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RECENT RESEARCH ON HUMAN PROBLEM SOLVING! 


CARL P. DUNCAN 
Northwestern University 


The present review summarizes 
most studies of human problem solv- 
ing that were published in the period 
1946 through 1957. A complete re- 
view of the literature on human prob- 
lem solving would have to include 
studies in which problem solving 
tasks were used in research on the 
subject variable of rigidity. Since 
Chown (1959) has recently published 
an extensive review of rigidity, the 
studies of problem solving which are 
cited in her paper will not be covered 
here. In the case of topics where 
Chown has summarized some of the 
relevant studies, her paper will be 
cited along with other pertinent in- 
vestigations. 

Within the area of thinking, the 
present review covers only experi- 
mental and theoretical studies that 
dealt with the problem solving per- 
formances of normal human adults. 
Thus, the scope of the paper is nar- 
rower than that of other recent re- 
views (Humphrey, 1951; Johnson: 
1950, 1955; Russell, 1956; van de 
Geer, 1957; Vinacke, 1952). 


DEFINITIONS 


Attempts to define thinking in gen- 
eral or problem solving in particular 
appear most clearly in the writings 
of Humphrey (1951), Johnson (1955), 
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Maltzman (1955), Ray (1955), Rus- 
sell (1956), Underwood (1952), van 
de Geer (1957), and Vinacke (1952). 
The defining characteristics most fre- 
quently mentioned are the integra- 
tion and organization of past experi- 
ence when the definition refers to all 
of thinking, and the dimension of dis- 
covery of correct response when ref- 
erence is made to problem solving 
specifically. Problem solving is con- 
sidered to be fairly high on the dis- 
covery dimension, as one way of dis- 
tinguishing it from conditioning and 
rote learning which are presumed to 
involve relatively little response dis- 
covery. Underwood (1952) gives 
three methods for determining the 
amount of overlap between condi- 
tioning and thinking. 

It is of interest to note that nearly 
all writers concerned with definitions 
emphasized that they were trying to 
define thinking or problem solving in 
such a way as to relate them to, not 
separate them from, simpler proc- 
esses like learning or perception. 
Maltzman (1955) and a few others 
distinguish between productive and 
reproductive processes within think- 
ing, but apparently no one any longer 
seriously defends a sharp distinction 
between higher and lower mental 
processes, particularly between think- 
ing and learning. That issues of this 
kind are not completely dead, how- 
ever, is indicated by the fact that van 
de Geer (1957) attempted to destroy 
the productive-reproductive distinc- 
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tion, and several other writers who 
were not primarily concerned with 
definitional problems also felt it nec- 
essary to state that thinking is part of 
learning or association (Cofer, 1957; 
Judson & Cofer, 1956; Judson, Cofer, 
& Gelfand, 1956; Saugstad, 1957; 
Weaver & Madden, 1949). 

A few other writers who have been 
somewhat concerned with problems 
of definition may be mentioned. In 
an extensive study of categorization 
and concept formation, Bruner, 
Goodnow, and Austin (1956) de- 
scribed broad classes of equivalence 
categories, one of which was “func- 
tional” categorization. The authors 
think this category includes at least 
those problem solving tasks where S 
must categorize an object as fitting a 
certain function, e.g., the pliers as a 
pendulum weight in the two-string 
problem. They also suggested that 
defining attributes are sometimes 


combined to create either new cate- 
gories or empty categories, and that 


these types of combination often oc- 
cur problem solving. These brief sug- 
gestions are worth noting if only be- 
cause they represent one of the few 
attempts in the literature to relate 
two major areas of thinking research, 
problem solving and concept forma- 
tion, other than by means of an all- 
inclusive definition (see also Under- 
wood, 1952). 

Galanter and Gerstenhaber (1956) 
define thinking in a way that seems 
to differ sharply from the usual defi- 
nitions (although the reviewer does 
not really understand their position), 
and Maltzman’s (1955) definition 
restricts thinking to articulate organ- 
isms. However, disagreement on 
definitions of either thinking or prob- 
lem solving is less than might be ex- 
pected; at least it was possible to 
hold a conference on human problem 
solving where some areas of agree- 
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ment were evident in the absence of 
a definition of the field (Hovland & 
Kendler, 1955). 

Any further pursuing of the issue 
of a definition of problem solving 
would lead into discussion of proc- 
esses in problem solving behavior, or 
into theory. Both of these topics can 
be handled better after the bulk of 
the empirical studies has been pre- 
sented. 

Most of the remainder of the 
paper is a review of empirical studies 
of human problem solving. Insofar as 
possible, the review is organized in 
terms of the independent variables 
that influence problem solving per- 
formance. The categorization of 
these variables that was finally de- 
cided on is not very satisfactory. In 
many cases investigators used highly 
specific variables or conditions and 
failed to suggest any similarity be- 
tween their variables and those used 
by other investigators. Thus, the re- 
viewer's categories are necessarily 
too arbitrary. 

Most of the studies to be reviewed 
seemed to fall into one of three major 
classes. In the first, the independent 
variables were introduced prior to 
testing on the final problem solving 
task, which task was the same for, 
and was presented under constant 
conditions to all Ss. These studies 
used what is essentially a training and 
transfer design. In the second group, 
the independent variables were intro- 
duced during work on the test prob- 
lem, or were changes in the problem 
itself. The third group contains 
studies where the variables were cer- 
tain characteristics of the Ss used. 
Some attempt will be made to differ- 
entiate subclasses of variables within 
each of these major groups. Other 
papers to be covered include studies 
of individual vs. group problem solv- 
ing, research on problem solving 
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processes, and contributions to the- 
ory. 


TRANSFER FOLLOWING VARIATIONS 
IN TRAINING 


Different Methods of Training 


Methods of ‘“‘understanding.’’ The 
first four studies reviewed here are 
similar in that all dealt more or less 
with transfer following training by 
memorization vs. training by various 
“‘understanding”’ methods. Hilgard, 
Irvine, and Whipple (1953), Hilgard, 
Edgren, and Irvine (1954), and Cran- 
nell (1956), all used Katona card 
tricks (Katona, 1940) as tasks; For- 
gus and Schwartz (1957) used various 
arrangements of letters. In all 
studies, different groups of Ss were 
first trained on problems solvable 
either by memorization (of, e.g., a 
certain order of cards or letters), or 
by learning, via one or more under- 
standing methods, a principle or 
technique presumably applicable to 
many such tasks. The differently- 
trained groups were then tested for 
recall of training problems, and for 
transfer to both simple and difficult 
new tasks. 

Hilgard et al. (both studies), and 
Crannell reported little or no differ- 
ence among methods on recall or on 
simple transfer tasks, but on more 
difficult transfer tasks certain under- 
standing methods, particularly the 
“Katona diagram,”’ produced super- 
ior performance. Forgus and 
Schwartz found that both demonstra- 
tion and discovery of the principle 
led to better performance than did 
memorization on all three tests. 

All four of the studies tested the 
same Ss successively on all three 
types of tests, so the results may have 
been affected by differential transfer 
effects among tests, or by varying in- 
teractions between particular train- 
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ing methods and particular tests. 
Also, the various training methods 
were complex, unanalyzed variables 
that are difficult to evaluate. For ex- 
ample, a principle may be a single 
item, suc); as a forrnula, in which case 
it is easily learned by an understand- 
ing group; Forgus and Schwartz 
found that their memorization group, 
which had to learn a series of items, 
required about twice as much prac- 
tice in original training as did under- 
standing groups. In contrast, the 
Katona diagram used by Hilgard et 
al. and Crannell, was an understand- 
ing method that required much 
original training. Further, an under- 
standing method may yield either 
positive or negative transfer depend- 
ing on the particular test task; some- 
thing like this apparently occurred 
with the “working backwards” 
method used by Hilgard et al. (1954), 
and Crannell. 

Hilgard et al. pointed to limited 
understanding of even an_ under- 
standing method as a source of error. 
The same point was made by Burack 
and Moos (1956), who found little 
transfer to solution of a mechanical 
puzzle from either verbal or actual 
presentation of illustrations of centri- 
fugal force, and by Székely (1950a) 
with problems requiring use of hydro- 
static principles. 

The point raised above that results 
may depend on the interaction be- 
tween a particular training method 
and a particular test task is illus- 
trated in Corman’s (1957) study. 
Groups given varying amounts of in- 
formation on how to attack Katona 
match problems produced more solu- 
tions than groups given varying 
amounts of information about the 
principle underlying all problems, 
whereas the latter groups did best 
when tested for ability to verbalize 
the principle. The problem of inter- 
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preting results when Ss are tested 
successively on a series of tasks was 
also noted. Although the training 
variables appeared to have signifi- 
cant effects on training, and on simple 
and complex transfer tasks, effects on 
both types of transfer tasks disap- 
peared when number of training 
problems attempted and solved were 
partialled out. 

Székely (1950b) reported that Ss 
trained on the principle of moment of 
ineritia by a ‘‘modern” method (first 
predict and watch demonstration of 
movements of a torsion pendulum, 
then read textbook material on me- 
chanics) did better on the two- 
spheres problem, which requires ap- 
plication of the principle, than did 
“‘traditional’’ method Ss (read text, 
then watch demonstration). But 
Maltzman, Eisman, and _ Brooks 
(1956) failed to duplicate this finding. 
Either method, or a combination of 
the methods, produced more solu- 
tions than a control group with no 
training, but there were no significant 
differences among the three experi- 
mental groups. 

Craig (1956) had Ss cross out the 
word that was unrelated to four other 
words; each such group of words 
utilized a different principle. The Ss 
who were told the principle applying 
to each block learned more during 
training than uninformed Ss, and, 
probably because of differential 
learning, retained more after 31 days. 
But on transfer to new items the 
groups did not differ, although both 
did better than they had on training 
items. 

In Buswell’s (1956) study of pat- 
terns of thinking, one experiment 
concerned the discovery of general- 
izations. The Ss were to discover a 
rule whereby they could get sums of 
ordered columns of numbers without 
simply adding. The Ss found the 
problem difficult, and had trouble 
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verbalizing the rule. On a test for 
transfer to similar problems, about 
half the Ss showed transfer. 

Other methods of training. Ray 
(1957) required Ss to state verbally 
what they were going to do before 
they were allowed to make motor re- 
sponses to a problem requiring turn- 
ing off a light with switches. This 
verbal work facilitated problem solu- 
tion, probably because, as was also 
shown, the verbal work increased S's 
tendency to respond systematically 
to elements of the problem. 

A specific systematic approach, the 
half-split technique, was taught to Ss 
by Goldbeck, Bernstein, Hillix, and 
Marx (1957); the technique was to be 
applied in a complex lights-and- 
switches apparatus problem. The 
technique was not particularly effec- 
tive until Ss were first taught the 
deductive skill of locating the ele- 
ments of the problem to which a sys- 
tematic approach could be applied; 
then the technique, as a device to 
improve efficiency, was an aid. 

Kendler and Kendler (1956) re- 
ported that 3 to 4-yr.-old children 
could make a correct inference when 
their training had included all of the 
separate part-tasks needed to make 
the inference. It is possible, however, 
that the children used body orienta- 
tion cues, in part, to make the cer- 
rect response. The Ss had to change 
their position with respect to the ap- 
paratus in order to learn one (pre- 
sumably the crucial one) of the neces- 
sary part-tasks. Significant infer- 
ential behavior occurred only when 
this part-task was the last one 
learned, i.e., immediately before the 
test trial. 

The preceding studies of various 
methods of training did not yield par- 
ticularly clear-cut results. However, 
the studies varied greatly from one 
another, and most dealt with rela- 
tively unanalyzed situations. Since 
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anything one learns probably has 
both positive and negative transfer- 
ring effects, depending on the situa- 
tion, more information is needed 
about what specific responses are 
learned under a particular training 
method, and what responses are re- 
quired on a particular transfer task. 
Also, more attention should be paid 
to the amount and breadth of train- 
ing, since a particular training method 
may yield positive transfer only if it 
is well learned in all its aspects. 


Amount of Training 


Except in studies of set (see later), 
very little research has been directly 
concerned with the effect of degree of 
original learning of responses which 
are expected to influence problem 
performance. This is surprising, since 
in most other learning situations both 
positive and negative transfer effects 
on a task are considerably influenced 
by variations in amount of practice 
on a similar training task. 

Three weeks prior to the problem 
solving session Marks (1951) gave 
some of his Ss a lecture which empha- 
sized analysis of a problem into its 
elements. The lectured group was 
not clearly better than the nonlec- 
tured group on a problem requiring 
finding errors in square roots, al- 
though a finer method of scoring solu- 
tions produced data indicating some 
superiority of the lectured group. 

French (1954) gave one group 
some preliminary training on a simp- 
ler version of a problem requiring 
turning off lights with buttons. This 
training greatly improved perform- 
ance on the final problem. More im- 
portantly, training interacted signifi- 
cantly with length-difficulty of the 
problem. With no prior training, the 
simple 4-item problem was much eas- 
ier to solve and learn than the 6-, 8-, 
or 10-item problems, which were 
clustered. But after training, the 4-, 
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6-, and 8-item problems were all 
solved about equally well, while 10 
items were still quite difficult. This 
shift in relative difficulty as a func- 
tion of prior training is an important 
finding that should be followed up. 
It also illustrates once again the 
point that results may depend upon 
interactions between particular train- 
ing methods and particular transfer 
tasks. 

Fattu and Mech (1953a) did one of 
the two experiments in which more 
than two amounts of training were 
employed. They compared groups 
given none, some, or much informa- 
tion about locating malfunctions in a 
gear train. Performance increased 
directly as a function of amount of 
training. Sato (1953) also compared 
groups given none, some, or much 
prior training with the characteristics 
of visual stimuli which were arranged 
in certain ways to provide problems. 
Difficulty of the problems was also 
varied. In general, differences in 
amount of training were significant 
for child Ss, but problem difficulty 
was more important for adult Ss. 

Although they performed no ex- 
periments, Bloom and Broder’s (1950) 
work suggests that problem solving 
proficiency may be improved by gen- 
eral training that is not tied to par- 
ticular kinds of problems. A general 
approach to problems (essentially a 
checklist) was developed from com- 
parisons of the problem solving be- 
havior of high grading and failing 
college students. Training sessions 
with the checklist improved perform- 
ance of failing students on various ex- 
aminations, although control groups 
were not employed. Bloom and 
Broder’s laboriously developed check- 
list deserves further study; there were 
hints in their work that training with 
the checklist might transfer posi- 
tively to a wide variety of problems. 

None of these studies of amount of 
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training varied some reasonably uni- 
dimensional method of training over 
a wide range. Even in the Fattu and 
Mech and Sato experiments, “‘much” 
training involved a qualitative as 
well as a quantitative change over 
“‘some”’ training. Only in research on 
set can one find a study relating de- 
gree of original learning, systemati- 
cally varied, to amount of transfer. 


Set 


Some situations are problems for 
adult Ss not because of deficiencies in 
S’s intelligence, motivation, or past 
experience, but because S is set to 
respond in certain ways. These sets, 
or momentarily dominant response 
tendencies, can have powerful effects 
in problem solving. Some tasks raise 
problems for human adults only be- 
cause of wrong sets; under other sets 
there is no problem. Perhaps because 
of this, much of the literature on set 
concerns negatively transferring sets. 

Simple sets. Nearly all studies of 
what will here be called simple sets 
have used either water jars problems 
or anagrams. Since a number of 
these studies have been reviewed by 
Chown (1959), only studies not cov- 
ered in her paper will be cited. 

The standard procedure with water 
jars (see Chown) may induce large 
amounts of set; Luchins (1946) re- 
ported that 83% of experimental Ss 
(those given training problems) made 
set responses on the transfer prob- 
lems, whereas only 0.6% of control Ss 
(no training problems) showed set. 
However, the amount of set with 
either water jars or anagrams is influ- 
enced by a number of variables. Set 
was increased by increases in the 
number of training problems (Mayz- 
ner, 1955; van de Geer, 1957), by 
speed instructions (van de Geer), by 
similarity between training and test 
anagrams (Maltzman, Eisman, 
Brooks, & Smith, 1956), and by un- 
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solvable training problems in some 
cases (van der Geer). Studies em- 
ploying other variables that have in- 
creased set are cited by Chown. 

Most of the above findings are 
clear-cut, but some qualifications 
should be noted. Van de Geer’s 
(1957) results were chiefly in the form 
of interactions among his six varia- 
bles. Thus, he found that unsolvable 
training problems increased set only 
in boys and only if extinction prob- 
lems were not given prior to transfer 
problems. Also, increasing the num- 
ber of training problems increased set 
most clearly when extinction prob- 
lems were not given. Rhine (1957) 
found that appropriate set (similar 
training and test anagrams) facili- 
tated test performance only when 
training anagrams were difficult and 
Ss had experienced some failure. 
With easy anagrams and success ex- 
periences, there was no difference be- 
tween groups trained under appropri- 
ate or inappropriate set. 

Set was decreased by extinction 
problems given prior to test problems 
(van de Geer), by increasing the num- 
ber of water jars (in training prob- 
lems, test problems, or both, Bene- 
detti, 1956), and by interpolating 
problems having different solutions 
among the training problems (Mayz- 
ner, 1955; Mayzner & Tresselt, 1956). 
Since distributed practice has been 
found to reduce set (Chown), the 
Mayzner, and Mayzner and Tresselt 
experiments are confounded because 
interpolating problems during train- 
ing necessarily distributes practice on 
training problems. 

The set studies demonstrate spe- 
cific positive or negative transfer 
from one response pattern to another, 
the direction of transfer depending on 
the particular relation between train- 
ing and transfer task. However, it 
would be expected that a series of 
training problems would also produce 
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a nonspecific positively transferring 
effect: learning how to learn (Dun- 
can, 1958; Harlow, 1949), or perhaps 
learning how to solve. Nonspecific 
transfer was demonstrated by Good- 
now and Pettigrew (1956). Groups 
were first trained to respond to spe- 
cific stimulus patterns in a two-choice 
situation, next were given random 
stimulus presentations (presumably 
to extinguish responding to pat- 
terns), and finally were tested for 
learning of new patterns. Such 
groups tended to learn new patterns 
more rapidly than Ss with no prior 
training. The authors believe that Ss 
without prior training have trouble 
because they pay too much attention 
to their own response patterns rather 
than to the stimulus patterns, and 
that this tendency may be a source of 
difficulty in a variety of problems. In 
any case, their results suggest the 
possibly powerful effects of nonspe- 
cific transfer, learning to think or 
learning to solve, in all kinds of prob- 
lems, effects which have been recog- 
nized by only a few writers (Harlow, 
1949; Underwood, 1952; Weaver & 
Madden, 1949). All water jar and 
anagram studies probably included 
some effects of learning to solve, in 
addition to specific positively or 
negatively transferring habits and 
sets. 

With a different type of problem, 
but one which involved set in some 
sense, Lawson, Hillix, and Marx 
(1955), and Hillix, Lawson, and Marx 
(1956) found no effect on transfer 
problems of number of reinforce- 
ments during training, and little ef- 
fect of similarity between training 
and transfer tasks. However, the 
problems (guessing circuits in a 
matrix of lights) differed widely from 
those usually used in set studies, and 
their Ss may have been able to dis- 
criminate fairly well between train- 
ing and transfer tasks. In the usual 
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set study, S has no way of knowing 
which is the first test problem (at 
least until he has solved it), a fact 
which probably tends to increase set. 

Very few investigators have used 
anything other than water jars or 
anagrams to study simple set, so 
practically all information comes 
from two rather similar types of prob- 
lems. Other problems are needed, as 
well as methodological work on water 
jars and anagrams. Frick and Guil- 
ford (1957) do not think that water 
jars induce a set of any considerable 
strength, and agree with Levitt 
(1956) that the problems are not a 
good psychometric or experimental 
instrument. No thorough methodo- 
logical study of anagrams was 
found, although Wiggins (1956), in 
one part of his study, revealed a 
source of uncontrolled variation in 
anagrams with two solutions. He 
scaled such anagrams in terms of the 
frequency with which one or the 
other solution was given by naive Ss 
and found variation over the entire 
range (.50 to .99 probability of occur- 
rence of one of the solutions). Wig- 
gins went on to show that training, in 
the form of brief study of the list of 
words which were the infrequent 
solutions, produced changes from 
giving the frequent to giving the 
infrequent solution. Anagrams in 
which neither solution was especially 
predominant originally were more 
subject to change. This experiment 
suggests that in studies of set, use of 
double-solution test anagrams which 
have, initially, equally likely solu- 
tions would produce a worthwhile re- 
duction in variability. 

It is unfortunate that investigators 
of set almost never presented learning 
curves for either training or test prob- 
lems. Analysis by stage of practice 
can reveal important information. 
For example, instructions to induce 
appropriate set may produce better 
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performance on early, but not on 
late, training problems because set 
(or habit strength) can also be de- 
veloped by solving a series of prob- 
lems of the same class. Learning 
curves for transfer problems would 
reveal the locus as well as the per- 
sistence of transfer effects, e.g., 
groups with inappropriate set might 
show negative transfer on early test 
problems but not on later test prob- 
lems because of learning how to solve. 
Van de Geer (1957) found that train- 
ing conditions had different effects at 
different stages of transfer practice. 

Although there are other factors 
that affect set (e.g., subject variables, 
“see later), the papers already re- 
viewed reveal that quite a lot is 
known about the functional relation- 
ships between a number of independ- 
ent variables and simple problem 
solving sets. At the same time, most 
of the information comes from water 
jars and anagrams, tasks that are 
sometimes held to exemplify only re- 
productive, not productive, thinking 
(Maltzman, 1955). Even if one does 
not (as the reviewer does not) hold 
to this distinction between different 
kinds of thinking or problem solving, 
there is no question that much more 
needs to be known about set in more 
complex problems. Certain difficult 
“insight” tasks, such as the pendu- 
lum solution of the two string prob- 
lem, appear to be problems only be- 
cause the situation evokes strong, 
though labile, response tendencies 
that do not lead to solution. Some 
information about sets in more com- 
plex problems is developed in the sev- 
eral types of experiments reviewed in 
the next section. 

Complex sets: functional fixedness 
and preavailability. All of these 
studies may be described as attempts 
to produce positive or negative trans- 
fer to a problem by procedures in- 
tended to change the order of dom- 


inance either of responses in a hier- 
archy, or of whole hierarchies. 
Duncker’s (1945) work introduced 
a type of complex set called func- 
tional fixedness, which may be de- 
fined as inhibition of use of an object 
in one function due to recent prior 
experience with the object's serving a 
different function. Chown reviews 
most of the functional fixedness stud- 
ies that have appeared since Dunck- 
er’s work. In a more recent study, 
van de Geer predicted that if an ob- 
ject were first used in an unusual 
function, no functional fixedness 
would be found when the object sub- 
sequently had to be used in a usual 
function (the typical order in func- 
tional fixedness studies is usual func- 
tion first, unusual function second). 
A sc.ewdriver and a wrench were 
available to loosen a screwhead bolt, 
or to serve as pendulum weight in 
the two-string problem. Although 
the number of Ss was smail, the re- 
sults appeared to confirm the predic- 
tion. The group that solved the prob- 
lem last tended to avoid the object 
used just previously to loosen the 
bolt, i.e., showed functional fixed- 
ness. But the group that solved the 
problem first did not tend to avoid, 
in loosening the bolt, the object that 
had been used as a weight. 
Functional fixedness is a complex 
set with negative transfer effects 
What are here called preavailability 
studies are attempts to induce com- 
plex sets with positive transfer ef- 
fects. Saugstad (1955) presented, one 
at a time, the various objects neces- 
sary to solve the Maier candle prob- 
lem and had S list all possible func- 
tions for each object; this was called 
an “availability” test. On the test, 
13 out of 57 Ss gave evidence that the 
necessary functions were available, 
i.e., listed functions that would later 
be necessary to solve the problem. 
All of these 13 Ss later solved the 
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problem, whereas Saugstad reported 
that only 58% of those who did not 
indicate that the necessary functions 
were available solved the problem. 
Although the experiment is not im- 
pressive statistically, it does suggest 
that problem solving was influenced 
by the preavailability (set, domin- 
ance) of crucial responses in the hier- 
archy of responses associated with 
each component of the problem. 

Staats (1957) had Ss list uses for a 
screwdriver and other objects, then 
solve the two-string problem with the 
screwdriver as the only object heavy 
enough to serve as pendulum weight. 
Only 7 of the 61 Ss initially indicated 
using a screwdriver as some sort of 
weight, whereas 55 Ss eventually 
solved the problem. Although this 
portion of the experiment was incon- 
clusive, Staats did find low but sig- 
nificant correlations between time to 
solve and frequency and latency of 
weight responses given in a postsolu- 
tion listing of screwdriver uses. He 
believed that these correlations be- 
tween verbal (listed uses) and instru- 
mental (problem solving) response 
hierarchies indicate that problem so- 
lution would have been facilitated if 
weight responses had been elicited 
in sufficient numbers prior to solu- 
tion. 

A different method of manipulat- 
ing preavailability was used by Jud- 
son, Cofer, and Gelfand (1956). 
Their Ss first learned several 5-word 
lists, among which were included 
words, in various numbers and in 
various contexts, relevant to solution 
of the later-presented probiem. 
Thus, rope, swing, and pendulum 
were presumably relevant to the two 
string problem; prop, ceiling, and 


floor were relevant to the Maier hat- 


rack problem. In general, the group 
that learned a list containing all 
three key words was better than 
other experimental and_ control 


groups at producing pendulum solu- 
tions to the string problem, or floor- 
to-ceiling solutions of the hatrack 
problem. Not all of the many dif- 
ferences (there were two replications 
of the string problem experiment) 
were statistically significant, and the 
findings were limited to men. Women 
produced few solutions of the desired 
type to either problem. 

Judson et al. also reported an ex- 
periment showing that reinforcement 
of one response, taken from a previ- 
ously elicited chain of free associa- 
tions, significantly increased the 
probability of occurrence of other 
words in the same chain. Brief men- 
tion was also made of two attempts to 
facilitate solution of the string prob- 
lem by prior elicitation of free associ- 
ations to a list of words, one of which 
was rope. In the first experiment, 
those who had given “swinging” 
associations to rope produced sig- 
nificantly more pendulum solutions 
than those who had not, but these 
results were not confirmed in the 
replication. However, the general 
trend of all their experiments tended 
to support their notion that set and 
direction in problem solving can be 
interpreted in terms of response hier- 
archies which are influenced by char- 
acteristics of the problem and by re- 
inforcement. 

In an oft-cited experiment, Maier 
(1930) claimed to have demonstrated 
that relevant past experience is not 
always sufficient to solve a problem. 
The Ss must also have “direction,” a 
sort of set or connection between 
past experience and present problem 
to enable them to bring the relevant 
past experience to bear. Weaver and 
Madden (1949) and Saugstad (1957) 
repeated the essential parts of Maier’s 
experiment by comparing groups 
given only the relevant past experi- 
ence with groups given past experi- 
ence plus the hint that supposedly 
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serves as direction. Neither found 
that addition of direction increased 
number of solutions. Saugstad also 
experimented with the three part- 
tasks Maier had used to provide rele- 
vant past experience for the test task 
(two-pendulum problem). Solutions 
of the two pendulum problem in- 
creased directly from demonstration 
of the part-tasks (Maier's procedure), 
to solving them as problems them- 
selves, to solving them when one of 
the three was presented in an im- 
proved version. Saugstad held that 
“availability of functions” was all 
that was necessary to solve the prob- 
lem. 

Weaver and Madden pointed out 
that Maier ignored nonspecific past 
experience (learning to learn?) in the 
form of habits of searching and ex- 
ploration, habits which may be 
transferred directly to the present 
problem without the aid of direction. 
Nevertheless, Maier, with his con- 


cept of direction, early called atten- 
tion to the fact that merely having 
relevant past experience is no guar- 
antee that S can bring it to bear to 
solve a problem. This is an issue that 
runs through much research and dis- 
cussion of problem solving in human 


adults; adult Ss “‘know”’ the correct 
responses, but do not have the cor- 
rect set. 
* One other study might be classi- 
fied under preavailability. Kolers 
(1957) used problems requiring ab- 
straction among forms presented on 
a screen. A cue form that would aid 
solution of the problem was flashed 
subliminally just before presentation 
of the problem. The results were un- 
clear in the first experiment, but in 
a second experiment there was some 
evidence that the subliminal cue 
aided problem solving. 

A possible reason why the preavail- 
ability studies did not yield clear-cut 
results is that the various situations 
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were usually needlessly complex, 
even cluttered. (To some extent this 
was also true of the functional fixed- 
ness studies.) For example, S was 
asked to list uses for several irrele- 
vant objects as well as the crucial ob- 
ject, or was asked to solve the prob- 
lem in the presence of several irrele- 
vant objects. No necessary purpose 
seems to be served by these or other 
ways of complicating the situation. 
Moreover, such complex situations 
probably generate a potpourri of 
positive and negative sets which are 
difficult to analyze and which may 
increase variability. Judson et al. 
(1956) seemed to be implying this 
criticism of overcomplexity when 
they suggested that their experiments 
may have been overcontrolled. 

In spite of the sometimes ambigu- 
ous results, and in spite of the small 
number of published studies, func- 
tional fixedness and preavailability 
experiments represent, in the re- 
viewer's opinion, one of the most 
fruitful types of research on problem 
solving. If one takes the position 
that for human adults many prob- 
lems are such that they demand re- 
sponses which are low in a hierarchy, 
then’ functional fixedness and pre- 
availability studies are seen as direct 
attempts to manipulate such hier- 
archies. With more detailed analysis 
of problem situations, and with re- 
finement of methods, such studies 
could contribute much to knowledge 
of the antecedent conditions of prob- 
lem solution. 

Order effects. For present pur- 
poses, experiments in which an at- 
tempt was made to influence prob- 
lem solution by varying the chrono- 
logical order of certain experiences 
are classed as studies of set. Some 
papers already reviewed dealt in part 
with effects due to order of experi- 
ence (Kendler & Kendler, 1956; 
Maltzman, Eisman, & Brooks, 1956; 
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Székely, 1950b; van de Geer, 1957). 

Stolurow, Hodgson, and Silva 
(1956) found negative transfer in air- 
plane mechanics from both orders of 
presentation of school training and 
brief job experience. Herman and 
Engstrand (1957) devised two classes 
of problems, one depending on posi- 
tion of letters on cards, the other de- 
pending on relationships in the alpha- 
bet. The results showed: positive 
transfer between problems of the 
same class, zero transfer from posi- 
tion to alphabet problems, negative 
transfer from alphabet to position 
problems. 

Swartz (1955) did not find any ef- 
fect on solution of a problem devised 
from playing cards by prior sorting 
of the cards into suits. 

It was pointed out earlier that dif- 
ferential, and unknown, transfer ef- 
fects may have been operating in a 
number of problem solving experi- 
ments. The research on order of 
presentation suggests that in design- 
ing studies of problem solving, one 
should not ignore the possibility that 
the experimental design may permit, 
even reinforce, differential transfer 
effects. 

In over-all view, this major sec- 
tion on training and transfer in prob- 
lem solving appears as follows. In the 
case of problems which depend on 
simple sets, the effective training 
variables were largely the same vari- 
ables, operating in much the same 
way, that influence transfer per- 
formance in other learning situations. 
No such summary statement can be 
made about the antecedent variables 
for any other types of problems, al- 
though a few kinds of complex sets 
had some effect. In part, research on 
complex problems has yielded con- 
flicting results; more importantly, 
too little research has been done. 
Furthermore, experiments on com- 
plex problem solving are mostly of the 
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simple two-group type; studies in 
which even one variable was system- 
atically manipulated over a wide 
range are almost nonexistent. Sys- 
tematic variation cannot, of course, 
be undertaken until variables are 
identified and dimensionalized, but 
little analytic work of this kind has 
been done in research on complex 
problem solving. 


VARIATION WITHIN THE PROBLEM 


In this group of studies, either con- 
ditions concurrent with the problem, 
or characteristics of the problem it- 
self, were varied. The experiments 
are extremely heterogeneous.  Be- 
cause of this, no good defense can be 
offered for the subcategories used. 


Methods of Presenting the Problem 
This category includes studies in 
which the same problem was pre- 
sented in different modes or appear- 
ances. The different modes were usu- 
ally, but not always, isomorphic to 
each other in the sense that relation- 
ships among the elements of the 
problem remained the same. 
Concreteness. Many problems can 
be presented in either symbolic or 
concrete (real) form, in various de- 
grees of these extremes, in miniature 
scale models of the real presentation, 
etc. Also, degree of overtness of S’s 
behavior, insofar as it is under the 
investigator's control, has been used 
as a method of varying concreteness. 
The following studies found no ef- 
fect of varying concreteness of the 
problem, or in some cases, of con- 
creteness of S's behavior: Saugstad 
(1957) with a miniature scale model 
vs. the real presentation of the two 
pendulum problem; or Lorge, Tuck- 
man, Aikman, Spiegel, and Moss 
(1955a, 1955b), who used the mined 
road problem at seven “levels of 
reality”’ (verbal, photographic, mini- 
ature scale model, or real presenta- 
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tion, or various amounts of manipula- 
tion of the scale and real versions). 
In Saugstad’s repetition of the Maier 
experiment, the “‘direction’’ was a 
hint that was supposed to call atten- 
tion to the ceiling. Saugstad thought 
that a miniature scale model, of the 
actual hallway in which the two- 
pendulum problem usually must be 
constructed, would call more atten- 
tion to the ceiling, but neither num- 
ber of solutions nor behavior of fail- 
ing Ss gave any indication that the 
ceiling was a special source of diff- 
culty 

In contrast to the preceding studies, 
Cobb and Brenneise (1952) and Gibb 
(1956), found rather clear-cut ef- 
fects by varying concreteness. Cobb 
and Brenneise reported that anchor, 
reach, and extension solutions of the 
two-string problem decreased as con- 
creteness decreased over four steps. 
Pendulum solutions were little af- 
fected but were few enough so that 
for all types of solutions combined, 


percentage solutions were perfectly 
correlated with increasing concrete- 
ness. Gibb used three types of sub- 
traction problems presented in three 


degrees of concreteness to second 
grade children. Both main variables 
were significant on most measures, 
and did not interact. If children are 
more affected by concreteness than 
are adults, Gibb’s results would not 
necessarily conflict with the studies 
reporting no effects of concreteness. 
But there is no obvious way of ac- 
counting for Cobb and Brenneise’s 
positive results. They did use what 
is probably more of an “insight” 
problem than did the studies report- 
ing negative results, but it was only 
the insight (pendulum) solution that 
not affected by concreteness. 
Their least concrete mode of presen- 
tation seems qualitatively different 
from the other three modes, but this 


was 
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would not account for all their re- 
sults. 

Distribution of work and rest. Pe- 
riods of work and rest on a problem 
can be varied in a number of ways. 
Riley (1952) found no clear-cut dif- 
ference between intertrial rests of 8 
sec. vs. 2 min. during learning of a 
rote list that required S to discover, 
to varying degrees, the response term 
for each stimulus. He noted that if 
anything, his results were the re- 
verse of the hypothesis that massing 
of practice should produce better 
performance early in learning be- 
cause it should facilitate discovery, 
whereas distribution should be better 
later in learning because it should 
facilitate fixation (Underwood, 1949, 
reviews the older studies from which 
this hypothesis was developed). 

Distribution of practice had clear- 
cut effects in Shaklee and Jones’ 
(1953) experiment when work and 
rest cycles were varied prior to solu- 
tion of a kind of matching-by-infer- 
ence problem. Groups worked under 
continuous practice, under cycles of 1 
min. work-30 sec. rest, or cycles of 1 
min. work-4 min. rest. In a second 
experiment the latter cycle was 
changed to 1 min.-90 sec. cycles. In 
both experiments, the first and third 
cycles, i.e., continuous practice and 
the quite distributed cycle, did not 
differ in terms of percentage of solu- 
tions, but both were significantly su- 
perior of the 1 min.-3 sec. cycle. This 
U-shaped function between correct 
solutions and distribution did not oc- 
cur with incorrect solutions, which in- 
creased directly with distribution of 
practice. 

It is rather clear that distribution 
of practice in problem solving needs 
further study. 

Other methods. Each of the follow- 
ing studies used a unique method of 
presentation. Katz (1949, experi- 
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ment more briefly reported in 1950) 
had adult Ss give sums based on the 
numbers 1-9; with children, the 
numbers were 1-5. Each number 
was printed on a card. The cards 
were presented in what might be 
called ‘‘degrees of disorder,” e.g., 
cards were presented in order in a 
column, in an unordered column, 
after being shaken in a box, etc. Time 
to give sums, and errors, increased 
directly with increasing disorder of 
presentation, both in children and in 
adults. 

The calculus of propositions tasks 
(see Moore & Anderson, 1954b) were 
presented by Anderson (1957) as if 
they had from 1-4 goals or solutions, 
when in fact there was only one goal. 
The number of Ss achieving the goal 
decreased directly as number of 
stated goals increased. This result 
may be roughly similar to one which 
apparently occurs with the two-string 
problem. Instructions to find as 
many solutions as possible, vs. in- 
sistence on the pendulum solution 
only, seem to elicit anchor, reach, and 
extension, at the expense of pendu- 
lum, solutions. 

Two other studies found no effects 
of different methods of presenting the 
problem. Fattu and Mech (1953b) 
reported no differences attributable 
to interrupting Ss at various stages 
of work on gear train problems and 
asking them to state verbally where 
the malfunction was. Hafner (1957) 
found no effect in fourth grade chil- 
dren of instructions to verbalize 
while working on a stencil design 
problem. 


In general, degree of concreteness 
has had little effect on problem per- 
formance in adults, except in Cobb 
and Brenneise’s (1952) experiment. 
Studies using other methods of vary- 
ing presentation are too few or too 
dissimilar to summarize. 
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Variation among Elements 
Problem 

These studies are also, in a way, 
methods of varying presentation of 
the problem, but in this case there 
was usually a real change in the prob- 
lem itself, e.g., a change in the num- 
ber, order, or kind of problem ele- 
ments. Perhaps the Katz (1949) and 
Anderson (1957) ‘experiments could 
just as well have been included here, 
as well as some experiments on 
simple sets. In the latter, it has been 
found that interpolation of various 
conditions among the test problems 
will reduce set, e.g., extinction prob- 
lems (van de Geer), additional jars 
(Benedetti). 

In Judson and Cofer’s (1956) ex- 
periment Ss had to select the word 
that was out of place in groups of 
words, each group containing two 
ambiguous and two unambiguous 
words. The Ss clearly chose on the 
basis of the first-appearing unambig- 
uous word; in the authors’ terms, 
“priority of activation of a response 
hierarchy” significantly influenced 
behavior. Increasing the number of 
ambiguous words between the two 
unambiguous words increased the 
dominance of the first-occurring un- 
ambiguous word. 

Surprisingly strong effects of spa- 
tial contiguity among elements of a 
problem were reported by Kay 
(1954). The Ss had to turn off a row 
of lights three feet away from a row 
of switches, using as a cue numbers 
printed in a random arrangement on 
a card. When a light came on, S as- 
signed it a number from 1 to 12 (left 
to right), ‘located the number on the 
card, and pressed the switch in line 
with the number. Time and error 
scores increased directly as the card 
was first placed directly in front of 
the switches, then moved to midway 
between, finally placed directly in 
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front of the lights. A few Ss, espe- 
cially older ones, could not do the 
task at all if the card was anywhere 
beyond the midpoint, i.e., closer to 
the lights than to the switches. The 
effects of contiguity might not have 
been so great if Ss had performed the 
most difficult task (card directly in 
front of lights) first, then transferred 
to the easier tasks. Nevertheless, the 
results clearly suggest that intraprob- 
lem contiguity is of fundamental im- 
portance in problem solving. It is 
possible that contiguity among the 
elements of a concrete problem heav- 
ily determines the degree to which 
such processes as reordering and re- 
structuring (Wertheimer, 1945) can 
occur. 

Se'ley (1957) made use of the fact 
tu.c the meanings of small, white, 
light, and up tend to be positively 
correlated, and opposite to large, 
black, heavy, and down. Different 
sets of discs (boxes), each incorporat- 
ing one of these dimensions, were 
used in the disc transfer problem. In 
six trials through the problem there 
were fewer errors when boxes had to 
be moved in the normal light-to- 
heavy direction than in its opposite. 
The ordinary size-cue expectancy, 
small-to-large, produced fewer extra 
moves and shorter time than its op- 
posite. Other comparisons were not 
significant. 

In Cobb and Brenneise’s experi- 
ment, anchor, reach, and extension 
solutions of the two-string problem 
decreased, pendulum solutions in- 
creased, when the _ investigators 
changed the group of objects ordi- 
narily available to an alternative 
group that was more relevant to 
pendulum solutions. 

Studies of behavioral processes in 
problem solving (see later’ sometimes 
also report changes in performance 
due to variation among problem ele- 
ments. Battig (1957) had Ss guess 
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the letters of a word with foreknowl- 
edge only of the number of letters in 
the word. The particular words used 
were a major source of variance; both 
length and frequency of usage of the 
words were complexly related to the 
several response measures. Hunter 
(1957) used different ways of stating 
problems of the type: A is greater 
than B, C is greater than B, which is 
greatest. There were differences due 
to the ways of stating the problems, 
to atmosphere effects, and to type of 
relation used (happier-sadder, taller- 
shorter, etc.). Hunter's study has 
some similarity to earlier research 
(not reviewed here) on atmosphere 
and order effects in syllogistic rea- 
soning. 

In contrast to the experiments on 
methods of problem presentation, 
studies of variation among problem 
elements consistently reported at 
least some significant effects, occa- 
sionally powerful effects, on problem 
solving performance. ‘Thus, per- 
formance on a problem may or may 
not be influenced by contextual vari- 
ables, such as methods of presenta- 
tion that do not change relationships 
among elements of a problem. But 
changes of a problem's internal struc- 
ture usually influence performance, 
even in cases where the problem re- 
mains, in some physical sense, the 
same. 


Difficulty 


All variables that significantly af- 
fect speed or frequency of Solution 
could be said to influence the diffi- 
culty of a problem. The studies to be 
reviewed here are those in which 
some condition intended to influence 
difficulty was deliberately varied. 

All experiments on methods of 
“understanding” (Corman, 1957; 
Crannell, 1956; Forgus & Schwartz, 
1957; Hilgard et al., 1953; Hilgard et 
al., 1954) used several transfer tasks, 
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some called simple, others difficult. 
In some cases, but not.always, it ap- 
peared that different training meth- 
ods produced differences only on dif- 
ficult transfer problems. 
Performance is rather clearly af- 
fected by deliberate increases in 
problem difficulty. Within limits, 
problem difficulty is increased by 
increasing: the number of stimulus 
items with number of response items 
held constant (Brush, 1956), the 
number of stimulus-response or total 
items (Brush, 1956; French, 1954), or 
the response availability, defined as 
the number of response items from 
which the correct response for each 
stimulus must be selected (Brush, 
1956; Noble, 1955; Noble, 1957; 


Riley, 1952). These studies make an 
important contribution to knowledge 
of S-R relationships in problem solv- 
ing; in particular, the response avail- 
ability experiments represent direct 
attacks on the important dimension 


of response discovery. The most ex- 
tensive work on response availability 
is that by Noble. He showed that 
with four stimuli, difficulty increased 
directly as number of available re- 
sponses per stimulus increased from 
4 to 10, but that there was relatively 
little further increase in difficulty 
from 10 to 14 alternatives. 

Ling (1946) and John (1957) give 
detailed protocols of changes in be- 
havioral processes that occur when 
problem difficulty is increased. Ling 
used Kéhler-type tool problems of 
increasing difficulty with young chil- 
dren. John developed a complex de- 
vice called the PSI (Problem-Solving 
and Information) Apparatus (see 
also John & Miller, 1957) which was 
used with two levels of difficulty with 
adults. In the Goldbeck et al. (1957) 
study of the half-split technique, use 
of different levels of difficulty on 
their apparatus revealed that the 
technique was of no value on the 
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more difficult problems until Ss were 
first given training on deductive 
skills. 

As might be expected, performance 
usually varied as a function of prob- 
lem difficulty. Noble’s work shows 
that the function is not necessarily 
linear. 


Hints and Aids 


Various hints, aids, or instructions, 
given S just before or during work on 
a problem, have been used to facili- 
tate solution. Maltzman, Eisman, 
Brooks, and Smith (1956) found that 
instructions influenced solution of 
test anagrams regardless of the class 
of training anagrams or the type of 
instructions that S had been given 
for training anagrams. In one of 
Burack and Moos’ (1956) experi- 
ments, three increasingly-concrete 
hints concerning the principle of cen- 
trifugal force were given one at a 
time at 2-min. intervals while S 
worked on the mechanical puzzle. 
After all hints had been given, five 
of the eight Ss had managed to solve 
the problem. 

Experiments in which aids of vari- 
ous kinds were of more primary con- 
cern have been reported by Reid 
(1951) and Marks (1951). Reid’s 
study was based on Duncker’s (1945) 
notion of ‘‘explication of the goal.” 
Several experiments were done on 
two problems: make triangles out of 
matches, and fit together pieces of 
wood to form a tetrahedron. Experi- 
mental grou; received hints at regu- 
lar intervals while working, each suc- 
cessive hint making the goal increas- 
ingly more explicit. The hints for 
control groups were not intended to 
explicate the goal. In general, each 
successive hint to experimental Ss 
increased the number of Ss solving 
the problem; eventually, — signifi- 
cantly more experimental Ss solved 
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in all experiments and on both prob- 
lems. 

Marks’ Ss tried to locate errors in 
square root problems. Tw» kinds of 
hints were used, a list of possible 
sources of error, or E’s urging S, at 
intervals, to ask himself where a 
mistake could occur. As Marks pre- 
dicted, verbal urging increased both 
S's vocalizations (naming or pointing 
to problem elements), and the num- 
ber of solutions, but contrary to pre- 
diction, the list had no effect on 
number of solutions. Verbal urging 
yielded tetrachoric correlations of 
.94 with vocalizations, .82 with solu- 
tions. 

All of the studies on aids found that 
at least some kind of aid was effec- 
tive, sometimes very effective. It is 
curious, then, to find an occasional 
study reporting that Ss were aided if 
necessary, but not reporting how 
many Ss were aided or if aid had any 
effect. 

In summary of this major section 
on variation of conditions during the 
solving of a problem, it may be noted 
that almost all variables studied have 
influenced performance. The major 
exception is the class of diverse pro- 
cedures called methods of problem 
presentation which, except perhaps 
for concreteness, yielded either con- 
flicting results or too few results with 
any one method to warrant a conclu- 
sion. 

The studies reviewed in this sec- 
tion illustrate a weakness that runs 
through the whole area of problem 
solving research, viz., the heterogene- 
ity of problems and techniques em- 
ployed. About 100 empirical studies, 
several of them including more than 
one experiment, are covered in this 
review. In nearly half of these studies 
the problem used was devised by the 
authors and has not yet been used by 
anyone else; even a brief description 
of each of these problems would have 
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added materially to the length of this 
paper. This diversity is a major rea- 
son why the area of problem solving 
seems so chaotic, and is a serious 
obstacle to systematic progress. A 
few authors stated the advantages 
that their new problems were pre- 
sumed to have, occasionally in sep- 
arate publications (Marx, Goldbeck, 
& Bernstein, 1956; Moore & Ander- 
son, 1954b), but most did not. Prob- 
lem solving research would be im- 
proved if more efforts were made to 
meet the standards for problems set 
by Ray 1955). 


SUBJECT VARIABLES 


Although a few experiments on 
problem solving have been expressly 
designed to test for effects of various 
characteristics of human Ss, most 
papers report effects of such variables 
as by-products. Therefore, most of 
the studies to be reviewed here have 
been cited earlier and will be only 
briefly described. 


Sex Differences 


Not infrequently, men have been 
found to be better problem solvers 
than women, but close examination 
of the literature reveals some qualifi- 
cations of this finding. 

Van de Geer (1957) reported two 
experiments showing that 12-yr.-old 
girls were both more susceptible to 
set and less able to surmount set than 
were boys of the same age. Van de 
Geer also showed that girls developed 
no more set from two training prob- 
lems than did boys, but that with six 
training problems girls developed so 
much set that unsolvable problems 
or speed instructions did not further 
increase their set. Rhine (1957) re- 
ported no sex difference in set on test 
anagrams. 

Men produced more pendulum so- 
lutions of the two-string problem 
(Cobb & Brenneise, 1952; Judson, 
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Cofer, & Gelfand, 1956; Staats, 
1957), but not more anchor, reach, or 
extension solutions (Cobb & Bren- 
neise). Staats suggested that women 
may have had more trouble with the 
pendulum solution because he found 
that they gave significantly more 
“hammer” uses for the available 
pendulum weight (screwdriver) than 
men. In Cobb and Brenneise’s ex- 
periment, men produced more total 
solutions than women under concrete 
methods of presenting the problem, 
but not under more abstract methods. 
Judson, Cofer, and Gelfand also 
found that women produced fewer 
floor-to-ceiling solutions of the hat- 
rack problem than men, and were 
not differentially affected by the vari- 
ous preavailability conditions. 

In studies with other complex 
problems, sex differences have been 
mentioned occasionally. Hilgard et 
al. (1954) found high school boys su- 
perior to girls on Katona card prob- 
lems. Saugstad (1952) used five 
complex problems to test his hy- 
pothesis that incidental memory 
should correlate negatively with abil- 
ity to solve such difficult problems. 
Significant negative correlations were 
found for boys but not for girls. 

McNemar (1955) found men sig- 
nificantly superior on a battery of 
reasoning items selected from some 
of Guilford’s tests, but Staats (1957) 
found no sex difference on the Ab- 
stract Reasoning Test of the Differ- 
ential Aptitudes battery. Moraes 
(1954) tested school children at sev- 
eral age levels on arithmetic reason- 
ing; boys were slightly superior to 
girls, but significantly so at only one 
age level. Engelhard (1955) also 
found few sex differences when boys 
and girls were tested on a variety of 
instruments dealing mostly 
arithmetic problem solving. 

Perhaps the best study on sex dif- 
ferences is that by Milton (1957). 
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She used 20 brief problems, 10 of 
which were said to require restructur- 
ing or altering initial set, 10 of which 
involved straightforward . solution. 
Men were significantly superior on all 
measures. However, score on the 
Terman-Miles M-F Scale, and the 
combined score from two other M-F 
scales, both correlated significantly 
with problem solving score. When 
M-F scores were partialled out in a 
covariance analysis, sex was not a 
significant variable on problem score. 
Furthermore, Terman-Miles_ scores 
contributed significant beta weights 
to problem scores within each sex. 
Thus, Milton argues that sex-role 
identification, learning of which be- 
gins in childhood, is an important 
variable in problem solving skills. 
This study, and Staats’ (1957) work 
on sex differences in response hier- 
archies, are important contributions 
to the issue of sex differences in prob- 
lem solving; both suggest that such 
differences as do occur result from dif- 
ferential past experience. 


Age Differences 


Age is usually an effective variable 
in most types of problem solving. 
Some of the age studies have been re- 
viewed by Chown (1959). In other 
studies, Sato (1953) found that chil- 
dren were more affected by amount of 
training than by difficulty level of 
the problems, whereas the reverse 
was true for adults. Katz (1949) 
found both children and adtilts to be 
hindered by his various methods of 
presenting the problem, but differen- 
tial effects of age could not be meas- 
ured since the children had been 
given easier problems. Hunter (1957) 
reported that 16-yr.-olds did better 
than 11-yr.-olds on his syllogistic-like 
problems. Moraes (1954) gives de- 
tailed protocols of patterns of think- 
ing demonstrated by school children 
of different ages as they worked on 
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arithmetic reasoning problems. 

In this group of studies, Ss have 
been differentiated mainly on the 
basis of chronological age. Nosystem- 
atic attempts have been made to re- 
late differences associated with chron- 
ological age to other variables, e.g., 
mental age (but see John, 1957). 


Reasoning Ability 


Scores on the Abstract Reasoning 
Test of the Differential Aptitudes 
battery were compared with several 
other performance measures by Staats 
(1957). The test correlated —.33 
(P <.01) with log time to solve the 
two-string problem, but showed near- 
zero correlations with various pre- 
availability scores derived from listed 
uses for the screwdriver. In Maltz- 
man, Eisman, and Brooks’ (1956) 
study with the two-spheres problem, 
Ss above the median on the Abstract 
Reasoning Test produced more solu- 
tions than did Ss below the median. 

The most extensive study relating 
reasoning ability to other measures of 
problem solving was done by McNe- 
mar (1955). A battery of four types of 
reasoning items, not correlated highly 
with intelligence, was used to select 
a group of high and a group of low 
reasoners from a large group of 
students. These groups were com- 
pared on free and controlled associa- 
tion tests (fluency), on induction and 
deduction problems (ability to bring 
past experience to bear), and on 
water jar problems (variability). 
Highs and lows were, as predicted, 
not different on free association, but 
highs produced increasingly more 
words as association became increas- 
ingly controlled. Highs were better 
in accuracy and speed on the induc- 
tion problem, but better only in ac- 
curacy on the deduction problem. 
McNemar found rather varied re- 
sults when Ss were questioned about 
their use of various methods of attack 
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on problems; however, the data did 
suggest that highs were better at ‘‘se- 
lecting’’ among relevant and irrele- 
vant aspects of past experience. On 
water jar problems, she found no dif- 
ference between highs and lows on the 
five training problems, but taking ac- 
count of a problem found to have two 
solutions, highs solved more than 
lows, as they also did on all problems 
(including two criticals, one extinc- 
tion) combined. Highs and lows did 
not differ in susceptibility to set, but 
highs were considerably better able 
to surmount set. 

Although the studies are few, the 
results are fairly consistent. Reason- 
ing, as measured by various tests, has 
been found to be related to most 
measures of problem solving perform- 
ance. 


Motivational Variables 


Scores on the Taylor Anxiety Scale 
have been related to performance on 
a few problem solving tasks. In the 
case of simple set problems, high 
anxiety has usually been found to 
produce stronger set (Chown). 
Mayzner and Tresselt (1956) did not 
get a clear-cut relation between anxi- 
ety and set in one experiment, but in 
a second experiment the low-anxious 
group produced significantly more 
direct solutions on test problems. 

Staats (1957) found that Taylor 
Anxiety scores correlated only .11 
with log time to solve the two-string 
problem. However, a comparison of 
high- and low-anxious groups would 
have been advisable since such 
groups may differ significantly in 
performance even though the over-all 
correlation between anxiety and per- 
formance is low. At the same time, 
Staats’ finding of no particular rela- 
tionship between anxiety and per- 
formance on a complex problem 
agrees with the results of Maltzman, 
Eisman, and Brooks, who found no 
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relationship between anxiety and 
performance on the two-spheres prob- 
lem. These authors also found no re- 
lationship between performance on 
the spheres problem and a test that 
presumably measured neuroticism. 
Findings with other variables here 
classed as motivational were reported 
by Rhine (1957) and Judson and 
Cofer (1956). Rhine found no rela- 
tion between anagram solving and 
scores on McClelland’s m Achieve- 
ment Test. In some of Judson and 
Cofer’s sets of four words, one of the 
two unambiguous words was a “‘re- 
ligious’’ word, the other was not. 
Frequency of church attendance 
among Ss was significantly related to 
frequency of exclusion of the nonre- 
ligious word on some of the items, 
There seems to be some relation- 
ship between scores on the Taylor 
Anxiety Scale and performance on 
simple set problems. Research on 
other motivational variables is too 
sparse to permit generalizations, al- 
though this review does not cover 
the literature in which problem solv- 
ing tasks were used to study person- 
ality (see Chown, 1959; Levitt, 1956), 
nor studies of the effects of social 
attitudes on syllogistic reasoning. 


Other Individual Difference Variables 


Other subject variables that have 
been employed in the study of prob- 
lem solving will be mentioned briefly. 
Koyanagi (1953) and Corman (1957) 
compared groups of high and low 
mental ability. Koyanagi’s bright 
children learned to cover a hole in a 
path so that a ball they were rolling 
along the path would not drop 
through. The dull children’s set for 
rolling the ball seemed to prevent 
their learning the anticipatory re- 
sponse of covering the hole. Cor- 
man’s brighter high school students 
benefited from large amounts of in- 
formation on how to attack Katona 
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match problems, but less bright stu- 
dents were able to utilize only small 
amounts of such guidance. This dif- 
ference was even more apparent when 
information about method and about 
rule were combined; with limited 
time on each problem, less able stu- 
dents could not integrate and use 
large amounts of information or guid- 
ance. 

Differences associated with S’s an- 
alytic habits have been noted by a 
few investigators. Behrens and Miles 
(1957) reported that trained observ- 
ers were consistently able to categor- 
ize Ss as analyzers or nonanalyzers on 
the basis of Ss’ verbal statements 
concerning their approach to block 
design problems. Classification as an- 
alyzer or nonanalyzer correlated .77 
in one group, .84 in a second group, 
with time to solve the problem. 
Bloom and Broder (1950) empha- 
sized analysis of the problem as part 
of their training for problem solving 
because of differences in analytic ap- 
proach which they had noted in com- 
paring good and poor problem solv- 
ers. In various indirect ways, : me 
of the studies of problem solving 
processes to be reviewed later sug- 
gest the advantage of analytic habits. 
Compare also Hilgard’s et al. (1953, 
1954) repeated emphasis on errors 
due to attitudes of carelessness (al- 
though this may be a motivational 
difference), and John’s (1957) de- 
scription of the differences in ap- 
proach to problems shown by Ss 
trained in the natural sciences vs. 
those trained in other disciplines. 

In Forgus’ (1957) study, five 
groups of Ss were differentiated in 
terms of degree of set on a long series 
of water jar problems. These groups 
were then given three tests designed 
to measure variability of response. 
As predicted, the groups did not dif- 
fer on mere number of sensibie re- 
sponses, but degree of set was in- 
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versely related, in two tests, to num- 
ber of different principles used among 
responses. In other words, set was 
not related to mere variability of be- 
havior, but “discriminatory varia- 
bility”’ decreased as set increased. 

Subject differences are sometimes 
studied by comparing groups of Ss 
identified as good or poor problem 
solvers. Fattu, Mech, and Kapos 
(1954) differentiated such groups on 
pretest gear train problems, then 
gave both groups two kinds of train- 
ing lectures and followed each lecture 
with a test on additional problems. 
The good group remained better 
problem solvers even on the last test. 
The greatest progressive improve- 
ment over tests was shown by some, 
but not all, of the poor group. The 
good group was much better on 
a magnitude-or-error measure, and 
showed less stereotypy. The patterns 
of search behavior shown by the poor 
group became increasingly similar 
to the patterns exhibited by the 
good group, but this improvement 
was not accompanied by much in- 
crease in number of problems solved. 
Fattu et al. criticized time as a meas- 
ure in problem solving because there 
were no differences in time scores for 
groups, tests, problems passed, or 
problems failed. 

Fattu’s et al. finding that initial 
differences between good and poor 
problem solvers were reduced but 
were not eliminated by training, is 
an important result, and one which 
has been found, directly or indirectly, 
in a number of other studies, e.g., 
Battig (1957), Bloom and Broder 
(1950), and John (1957). In Battig’s 
word-formation problems, high scor- 
ing Ss were shown to have more ap- 
propriate letter preferences (a power- 
ful variable in these problems) than 
low scoring Ss, who were more bound 
by alphabet order. High Ss showed 
better search patterns; they exhibited 
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response consistency at the beginning 
of a problem, variability later. In 
contrast to Fattu et al., Battig’s 
groups were differentiated by time 
scores; high Ss took more time per 
guess. 

Other studies reporting various 
comparisons between groups differen- 
tiated as good or poor are: Bloom and 
Broder (1950); Engelhard (1955); 
Goldbeck et al. (1957); Hillix et al. 
(1956); Kliebhan (1955); Lawson et 
al. (1955); Moraes (1954), and 
Székely (1950b). Bloom and Broder, 
and Moraes made extensive com- 
parisons of differences in problem 
solving processes between good and 
poor groups. Engelhard and Klieb- 
han selected high and low groups of 
girls and of boys on both an intelli- 
gence test and an arithmetic problem 
solving test. The highs were signifi- 
cantly better on 15 other tests. Gold- 
beck et al. found that the half-split 
technique was of more help to high 
ability Ss. If, in Székely’s study, one 
ignores the differences attributed to 
different methods of training (which 
were not confirmed by Maltzman, 
Eisman, & Brooks, 1956), the results 
are even more clear-cut; those who 
understood the principle underlying 
an earlier, different problem (reported 
in Székely, 1950a) produced more 
solutions of the two spheres problem. 
Lawson et al. found that in choice of 
alternative solutions on the test prob- 
lems, slow learners were more af- 
fected than fast learners by set in- 
duced by the training problems, but 
in the Hillix et al. experiment, where 
the test problem was related to the 
training problem in several ways, fast 
and slow learners did not differ in 
choice of solution. 

Research employing only or pri- 
marily correlational! techniques, such 
as psychometric and factor analytic 
studies, is not reviewed here. How- 
ever, several studies cited elsewhere 
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in this paper also included compari- 
sons, usually via correlation, among 
subject variables, among different re- 
sponse measures, among response 
measures and various standardized 
tests, etc. A few of these comparisons 
were mentioned earlier; others, which 
are too varied to be summarized here, 
appear in Battig (1957); Frick and 
Guilford (1957); Maltzman, Eisman, 
and Brooks (1956); Marks (1951); 
McNemar (1955); Saugstad (1952); 
and Staats (1957). 

In summary of the several cate- 
gories of subject variables reviewed 
above, it may be said that nearly all 
such variables for which at least a 
few references are available, have had 
some influence on problem solving 
performance (see also studies of sub- 
ject variables reviewed by Chown). 
Furthermore, the effects of some of 
these variables were not always lim- 
ited to a particular kind of problem; 
their effects tended to be somewhat 
general. At the same time, research 
in this area often comes out with de- 
tailed findings that are difficult to re- 
late either to each other or to the find- 
ings of other studies. Research 
modeled after the studies of Fattu, 
Mech, and Kapos (1954), McNemar 
(1955), or Milton (1957), or perhaps 
experiments based on Guilford’s 
(1956) factors, would yield more 
systematic knowledge. 


INDIVIDUAL Vs. GRrouP PROBLEM 
SOLVING 


Several carefully done experiments 
in the recent literature bear on the 
question of whether groups solve 
problems better than do individuals. 
Taylor and Faust (1952), and Lorge, 
Tuckman, Aikman, Spiegel, and 
Moss (1955a, 1955b, 1956) found 
groups superior on at least some re- 
sponse measures. Taylor and Faust 
compared individuals, groups of two, 
and groups of four, on the game of 
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“Twenty Questions.” All Ss were 
instructed that number of questions 
was the important score. On number 
of questions and on time, twos and 
fours did not differ, but both were sig- 
nificantly better than individuals. 
Failures decreased directly from indi- 
viduals to twos to fours (all differ- 
ences significant). On an efficiency 
measure (man-minutes: number of 
persons X time), individuals were bet- 
ter than twos, twos were better than 
fours. The authors could also have 
shown that individuals took by no 
means twice as miany questions as 
twos or four times as many as fours. 
Practice effects over days did not dif- 
fer as a function of the three condi- 
tions. 

The first two stucies by Lorge et 
al. (1955a, 1955b) were cited earlier 
in connection with their seven meth- 
ods of presentation of the mined road 
problem. Individuals were compared 
with groups of five under all methods. 
For scoring, a content analysis was 
made of S’s written solutions and 
crucial aspects of the solution were 
weighted. There were highly signifi- 
cant differences in favor of groups 
over individuals on this ‘‘quality-of- 
solution”” measure under all methods 
of presentation, with no interaction. 
Groups asked more questions than 
individuals, suggesting that group 
superiority was in part due to ob- 
taining more information. In an- 
other experiment (Lorge et al., 1956), 
only the real presentation of the prob- 
lem was used. Group superiority was 
again evident. It was also shown 
that in their written reports, groups 
tended to underestimate the quality 
of their actual solutions (as measured 
by reliable observers). In part, indi- 
viduals tended to overestimate their 
solutions. In these several experi- 
ments, Lorge et al. did not report an 
efficiency measure; groups were bet- 
ter in over-all quality of solution, but 
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were almost certainly not five times 
better either in over-all quality or in 
any one component of solution. 

McCurdy and Lambert (1952), 
Moore and Anderson (1954a), Mar- 
quart (1955), and, perhaps, Comrey 
and Staats (1955), found no evidence 
for group superiority. The McCurdy 
and Lambert problem required turn- 
ing six switches. Working individu- 
ally, S turned all switches; working 
in groups of three, each S turned two 
switches. Groups were no better than 
individuals, and leaderless groups 
were no different from groups in 
which one S gave directions that the 
others had to follow. 

Moore and Anderson first matched 
groups of three Ss with individual Ss 
on knowledge of the calculus of prop- 
ositions tasks. Over a 10-day period 
of solving problems, individuals did 
not differ significantly from groups 
on: number of problems solved, 
mean steps taken on problems, mean 
time on solved problems, mean er- 
rors, or on two measures of repeti- 
tiousness of response. On a man- 
hour basis, individuals were almost 
three times as efficient as groups. 
Moore and Anderson had forced 
groups to agree on steps in solving, so 
one member would not dominate; 
thus, they noted that groups had to 
work as groups, a responsibility not 
saddled onto individuals. 

Marquart (1955) repeated and ex- 
panded the oft-cited Shaw (1932) 
study, using eight problems cf vari- 
ous kinds. All Ss worked on all prob- 
lems, both as individuals and ‘as 
members of groups of three. By 
Shaw’s method, involving compari- 
son of total solutions to total possible 
solutions, groups were superior. But 
since this method does not indicate 
whether a group solution was merely 
due to the best member, Marquart 
combined individuals into “groups” 
of three. By this method, groups 
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working as groups were no better 
than groups working as individuals. 
Marquart also used her method to 
reanalyze Shaw’s data, and found 
little difference between Shaw's 
groups and individuals. 

The Ss solved crossword puzzles 
in Comrey and Staats’ (1955) study, 
first solving individually, then in 
pairs where one S had the vertical 
code, the other the horizontal code. 
It was shown that 82% of the vari- 
ance on the group task could be pre- 
dicted from a linear combination of 
perfectly reliable high and low indi- 
vidual scores. 

The results of the preceding group 
of experiments can be fairly easily 
summarized. On “over-all” types of 
measures, groups have been superior 
to individuals on a few problems, but 
not on most problems. But where 
efficiency measures were reported, 
and also probably where they were 
not reported, individuals were supe- 
rior. 

Although theories of problem solv- 
ing will be taken up later, Lorge and 
Solomon’s (1955) paper is of interest 
here since it deals with two models of 
group problem solving. Working 
with Shaw’s (1932) data, Lorge and 
Solomon noted that some groups 
solved all the problems, some solved 
none. This suggested that group 
superiority was due to the abilities of 
members of the group, rather than to 
interpersonal interaction. So two 
ability models were proposed: (A) 
group superiority is a function only of 
the ability of one or more of its mem- 
bers to solve the problem without re- 
gard to acceptance or rejection of 
members’ suggestions, (B) group 
superiority is a function only of the 
pooled abilities of its members. 
Pooled abilities can produce solutions 
even though no member of the 


group can solve alone. Model B im- 
plies that any problem may be com- 
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posed of, and solved in, two or more 
stages, so it reduces to Model A for 
one-stage problems. Model A was 
found to be tenable for two of Shaw’s 
three problems. It was also shown 
that Model A can be modified for 
stage-wise solutions. From applying 
Model B, the authors concluded that 
Shaw’s data suggest, not personal 
interaction, but pooled abilities in 
two-stage problems, i.e., Model B. 
Perhaps Marquart’s (1955) meth- 
od of combining individuals into 
“groups” to compare with actual 
groups, and her finding that these 
two types of groups did not differ, is 
a statistical pooling of abilities which 
produces solutions even without face- 
to-face contact. 


PROBLEM SOLVING PROCESSES 


There seems to be more concern 
with behavioral processes, as presum- 
ably different from products, in the 
field of thinking and problem solving 
than in any other area of learning or 
performance. As compared to the 
literature before 1946, recent investi- 
gators tend more and more to report 
only products, e.g., so many Ss solved 
the problem, so many did not. Even 
so, perhaps half of all papers cited 
in this review have had something or 
other to say about processes. Obvi- 
ously, only major studies of processes 
can be summarized here, and these 
only very briefly. Studies of, or dis- 
cussions about, problem solving proc- 
esses are often long and extremely de- 
tailed. 

“‘Processes’”’ can mean almost any- 
thing: insight vs. trial and error, re- 
sponse variability, flexibility vs. rigid- 
ity, methods of attack, basic proc- 
esses such as perception, memory, 
intelligence, learning, etc. Other so- 
called processes are sometimes named 
and described in terms of the charac- 
teristics of the particular problems 
used in a study. This diversity pre- 
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cludes any close comparison of the 
results of different studies. Further- 
more, processes are sometimes stud- 
ied merely by giving a single group 
of Ss a problem and describing the 
Ss’ behavior in verbal or frequency 
distribution form; there may »e little 
attempt to quantify processes or to 
vary conditions. Some of the distinc- 
tion between process and product 
would disappear if more efforts were 
made to determine functional rela- 
tionships between dimensionalized 
processes and systematically varied 
conditions. 

Bloom and Broder’s (1950) reme- 
dial work with failing college students 
was based on observations of success- 
ful and unsuccessful problem solvers, 
i.e., students who did well or poorly 
on problem solving types of exam- 
inations. Detailed descriptions of 
differences in problem solving behav- 
ior, and in personality, between good 
and poor solvers were reported. The 
Ss’ responses were classified under: 
understanding the nature of the prob- 
lem, understanding the ideas con- 
tained in the problem, general ap- 
proach to the solution of problems, 
and attitude toward problem solving. 
All of these classes revealed differ- 
ences between good and poor solvers. 
The authors also noted that good and 
poor solvers differed not so much in 
having relevant information, but in 
applying it to a problem. McNemar 
(1955) reported a somewhat similar 
finding. Bloom and Broder claimed 
that problems have a figure-ground 
organization in that some elements 
of a problem stand out much more 
than others, and that some elements, 
not necessarily figural ones, furnish 
starting points much more than 
others. It seems likely that such fig- 
ural elements of all kinds are prime 
inducers of sets. 

In Buswell’s (1956) very extensive 
study, over 500 Ss were observed 
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while working on various mathemati- 
cal problems, while attempting to dis- 
cover and transfer generalizations 
(this portion of the study was cited 
earlier), and while selecting from 
cards the steps and methods they 
wanted to use in solving a problem. 
Buswell reported great individual dif- 
ferences, and trial and error rather 
than systematic approaches. In no 
case were as many as 20% of the 
group represented by any one pat- 
tern of thinking; the evidence gave 
no support to any netion that prob- 
lem solving must foilow precise rec- 
ipes. 

Earlier it was mentioned that John 
(1957) found rather consistent dif- 
ferences between those trained in nat- 
ural sciences and those trained in 
other disciplines on his PSI problem. 
Actually, John tested six groups, 
varying in kind and amount of educa- 
tional background, on two levels of 
difficulty of the problem. Eight work 
variables, four information variables, 


and nine approach variables were 


studied and _ intercorreiated, and 
changes in these patterns of behavior 
from the simpler to the more difficult 
problem were reported. These data 
cannot be summarized here, nor can 
John’s over-all description of the 
problem solving process. Some of the 
points that were emphasized were 
that past training and experience 
brought about habituation of an in- 
dividual to certain kinds of concep- 
tual and organizational processes 
which were consistently displayed, 
that some aspects of personality were 
reflected in the problem solving proc- 
ess, and that present level (not type) 
of academic training did not appear 
to change parameters of effectiveness 
on the problems to any great extent. 

Goldner (1957) studied ‘whole- 
part approach” and “‘flexibility-rigid- 
ity’’ on six problems. Although there 
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were the usual individual differences, 
intra-individual consistency was 
fairly high from problem to problem 
on the whole-part variable. Flexibil- 
ity-rigidity was also fairly consistent 
on similar tasks, but not on tasks 
that differed in structure. The two 
dimensions were separate processes 
in less structured problems, but were 
closely related in more structured 
problems. 

Practice effects, which seem ubiq- 
uitous in other areas of performance, 
have not always been found in prob- 
lem solving. One example (there area 
few others) occurred in Bendig’s 
(1953, 1957) work on patterns of be- 
havior in solving twenty questions 
problems. Bendig’s interest was in 
the information transmitted by ques- 
tions and used by Ss. Although there 
were changes over probiems in some 
of the information measures, and 
other significant effects, there was no 
learning, at least by Bendig’s method 
of measurement, over problems in 
either study. These results concern- 
ing practice effects conflict to some 
extent with those of Taylor and Faust 
(1952), but Bendig did not use the 
twenty questions game in the usual 
way, and Taylor and Faust’s work 
was conducted over a much longer 
series of problems. 

Two other major studies of prob- 
lem solving processes are those by 
Moraes (1954), and Siillwold (1954). 
Part of Moraes’ study was cited ear- 
lier in other connections; the major 
part of the work is the detailed proto- 
cols of thinking processes obtained by 
comparing children who were good 
vs. those who were poor at arith- 
metic reasoning. Siillwold claimed 
that problem solution has two phases, 
one of sudden insight, the other where 
progress is slow, and that individual 
differences in exhibiting these phases 
were consistent from problem to 
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problem. However, van de Geer 
rather thoroughly disputes Siillwold’s 
claims. 

The preceding studies do not ex- 
haust all that has been said in the re- 
cent literature on problem solving 
processes (see Chown). Most of the 
theoretical articles to be reviewed 
later, as well as all of the books and a 
good many of the empirical investiga- 
tions that were cited in other connec- 
tions have included something about 
processes. Among empirical studies 
extensive data or discussion relevant 
to processes appear in Ling (1946), 
Székely (1947), and Weaver and 
Madden (1949). 

In the reviewer’s opinion, it would 
be preferable to devote more effort to 
determination of functional relation- 
ships between environmental cr task 
variables and performence or prod- 
uct, rather than to problem solving 
processes. In oversimplified terms, 
determination of what the simple 


laws are must precede attempts to 
determine why and how the laws op- 


erate. At the same time, resec.«h on 
processes would make a greater con- 
tribution if efforts were made to de- 
velop some sort of rough classification 
of behavior patterns on which in- 
vestigators could agree and which 
would be used in more than one 
study. Possible starting points would 
be Bloom and Broder’s (1950) check- 
list, Guilford’s (1956) factors, etc. At 
present, the chief weakness of re- 
search on behavior patterns in prob- 
lem solving is that the research area 
itself is so unpatterned. 


THEORY 
It is encouraging to find that in an 
area as unintegrated as is research on 
problem solving, there are a number 
of good theoretical beginnings. The 
most thoroughgoing attempt in the 
recent literature to develop and test 
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a theory of problem solving is that by 
Maltzman and his associates. In the 
major theoretical paper (Maltzman, 
1955), the idea of the habit family 
hierarchy, derived primarily from 
Hull, is used. The divergent, trial 
and error mechanism (one stimulus 
leading to a hierarchy of responses 
in which the correct response has low 
initial strength), and the convergent, 
discrimination learning mechanism 
(one response is led to by a hierarchy 
of stimuli in which the correct stim- 
ulus is initially low in the hierarchy), 
are combined to assume a compound, 
temporal hierarchy. Reinforcement 
or extinction of individual members 
of a hierarchy are assumed to gen- 
eralize to other members. Changes in 
order of dominance in a hierarchy, or 
among hierarchies, may be produced 
by extinction of dominant incorrect 
responses or response families, by in- 
creasing the reaction potential of the 
correct response through mediated 
generalization ‘rom other reinforced 
members, or by elicitation of frac- 
tional anticipatory goal responses. 
Concerning extinction of dominant 
incorrect responses, Maltzman 
pointed out that spontaneous recov- 
ery may occur; thus, interfering re- 
sponses may recur repeatedly while S 
is working on a problem. (An ex- 
ample of this apparently occurred in 
the problem used by Kay, 1954.) Me- 
diated generalization, a basic notion 
in the theory, is said to be accom- 
plished primarily by linguistic re- 
sponses. Fractional anticipatory goal 
responses are used to interpret set; 
they are responses that are evoked by 
instructions, hints, etc. This is a val- 
uable suggestion; although sets of all 
kinds play an important role in many 
types of performance, they have re- 
ceived little attention from learning 
theorists. 

Failure to solve a problem, ina- 
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bility to overcome wrong set, and 
similar phenomena, can be accounted 
for by Maltzman’s theory. He points 
out that if the correct response is low 
in the hierarchy, generalized inhibi- 
tion from repeated unsuccessful oc- 
currences of the dominant incorrect 
response may reduce reaction poten- 
tial of the correct response below the 
threshold. Also, high irrelevant 
drive, such as anxiety, will not only 
produce competing responses, but by 
increasing the total drive will multi- 
ply by all habit strengths, thereby in- 
creasing the advantage of a dominant 
incorrect response over a weaker cor- 
rect response. As noted earlier, the 
prediction concerning irrelevant 
drive has received some confirmation 
in simple set problems (Chown, 1959; 
Mayzner & Tresselt, 1956). (Predic- 
tion of the effect of irrelevant drive 
on other problems is difficult; a dom- 
inant incorrect set should be in- 
creased in strength, but the simul- 
taneous occurrence of competing re- 


sponses might facilitate solution by 


increasing response’ variability.) 
Other predictions from Maltzman’s 
theory, and some expansions of the 
theory, appear in his several empiri- 
cal studies, cited earlier. The theory 
does not come to close grips with such 
tasks as the two-string problem, but 
it is still one of the most fruitful the- 
ories yet offered in problem solving. 

Borrowing from Maltzman and 
others, Cofer and his associates 
(Cofer, 1957; Judson & Cofer, 1956; 
Judson, Cofer, & Gelfand, 1956) em- 
phasize particularly the role of verbal 
responses as mediators in response 
hierarchies. The several experiments 
(reported in the two 1956 papers and 
cited earlier) deal either with varia- 
bles that produce changes of dom- 
inance in verbal response hierarchies, 
or with the effects of such changes on 
problem solution. The former type of 
experiment was quite successful; the 
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latter was fairly successful. Although 
Cofer deals mostly with hierarchies 
among verbal responses, as does 
Maltzman, there are also, of course, 
hierarchies among instrumental re- 
sponses. Staats (1957), borrowing 
from Osgood (1953), developed his 
experiment on the basis of possible 
relationships between verbal and in- 
strumental hierarchies associated 
with the same stimulus object. 

A phenomenological theory of 
problem solving has been presented 
in some detail by van de Geer (1957). 
Briefly, different aspects of the same 
object may appear in perception; 
therefore, situations vary in degree 
of “transparency.” In thinking, 
other aspects of the situation must 
be explicated, thereby reducing the 
nontransparency of the situation. In 
connection with this theory, van de 
Geer makes a worthwhile effort to 
classify problems. His major cate- 
gories are three “points of view” 
toward problems: in what way does S 
try to solve, what is the nature of the 
difficulty of the problem, what is the 
nature of the initial and the goal 
situations. Each point of view pro- 
vides, to some extent, a classification 
of problems. For example, the dis- 
tinction sometimes made between in- 
sight problems and trial and error 
problems appears, in other terms, 
under “nature of the difficulty.” 

Saying that a phenomenological 
theory does not lead directly to a pro- 
gram for experimental research, van 
de Geer goes on to present an axio- 
matic approach to problem solving, 
based on game theory and informa- 
tion theory. He shows how the model 
handles each of the types of problem 
listed under his ‘“‘nature of the diffi- 
culty” category. In this connection 
van de Geer claims that S’s intelli- 
gence and “thinking-out capacity” 
determine how difficult S will find a 
problem to be, and therefore whether 
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S will show insight or trial and error. 
This attempt to reduce insight and 
trial and error to a single underlying 
principle has some similarity to 
Galanter and Gerstenhaber’s (1956) 
relating of the two patterns of behav- 
ior. 

Van de Geer’s point that phenom- 
enological theory does not easily gen- 
erate experimentally testable hy- 
potheses alse applies to some other 
““theories,’’ descriptions of processes, 
lists of steos toward solution, etc., 
which have been offered in the area 
of thinking and problem solving. In 
sharp contrast, —Jnderwood (1952) 
presents a combinz tion of theory and 
orientation toward research that di- 
rectly suggests manipulatable varia- 
bles. To begin with, thinking, includ- 
ing both concept formation and prob- 
lem solving, is said to be the learning 
or the recognizing (discovering?) of 
perceptual or functional relationships 
among stimuli. Stimuli may include 


objects, symbols, or other relation- 


ships (as in ‘syllogisms). The basic 
assumption is that for the perception 
of relationships among stimuli to oc- 
cur, the appropriate responses to 
those stimuli must be contiguous. 
The reviewer would interpret this 
assumption to mean that when pre- 
sented separately, S; and S, lead, or 
can be made to lead, to the same R;. 
Since both stimuli lead to the same 
response, there is a relationship be- 
tween them which, however, will not 
necessarily be perceived unless they 
are presented in such a way that 
“both” Ris occur contiguously. A 
mediational mechanism could also be 
included: the first R; produces stim- 
uli, traces of which overlap with oc- 
currence of the second R;. 

Whether or not this is a correct in- 
terpretation of Underwood's basic 
assumption, it is clear that manipu- 
latable variables in thinking are those 
factors that increase or decrease re- 
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sponse contiguity. Underwood men- 
tions such factors as mode of presen- 
tation of stimuli, number and sim- 
ilarity of stimuli, several kinds of 
biases, and memory. He points out 
the importance of response hier- 
archies, and indicates that the theory 
leads to a number of predictions. One 
of the predictions is that massed 
practice, by preventing forgetting, 
may be better than distributed prac- 
tice in thinking. The literature bear- 
ing on this point is conflicting. Al- 
though Underwood's theory is not as 
easily applied to some types of com- 
plex problems as it is to other types 
or to concept formation, it is more di- 
rectly tied to a basic process (contig- 
uity) than is any other theory, and is 
on. of the best single suurces for re- 
search hypotheses. 

Except for van de Geer’s phenom- 
enological theory, all of the theories 
discussed so far are S-R behavioristic 
types. Unfortunately, much less has 
been done to expand Gestalt theory. 
Distinctions between productive and 
reproductive thinking, discussions of 
the role of insight, and experiments 
on functional fixedness, explication 
of the goal, and water jars set, all 
stem more or less from Gestalt ori- 
gins. Humphrey (1951), and van de 
Geer (1957) indicate strengths and 
weaknesses of Gestalt theory, and 
Saugstad (1957) rejects it. But only 
Helson and Helson (1946) have made 
a serious attempt to generalize the 
theory to new situations. Their ap- 
proach is to show that configura- 
tional principles also apply to ab- 
stract, symbolic problems, as well as 
to Wertheimer’s (1945) geometric 
tasks. They go through the steps 
needed to solve a mathematical prob- 
lem by analysis of the whole (equa- 
tion) into natural parts related to it, 
rather than by trial and error or by 
use of high-level mathematical 
knowledge. It is shown that reorient- 
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ing the equation by use of new sym- 
bols aids solution, and this reorient- 
ing is said to be the same process that 
operates in geometrical problems or 
in perceiving hidden figures. (Others 
have directly attempted to relate re- 
organization in certain problems to 
scores on hidden figures tests: see 
Chown, 1959; Frick & Guilford, 
1957.) Helson and Helson consider 
that substitution of symbols or of new 
symbols is the distinguishing mark of 
abstract thinking, and that it is fre- 
quently desirable to replace concrete 
features with symbols since symbols 
are easier to manipulate and also tend 
to suggest new combinations. This 
point may have some relation to the 
“concreteness” studies reported ear- 
lier; if Ss do tend to replace concrete 
features with symbols, the failure of 
most of the concreteness studies to 
find any difference among various 
perceptual or symbolic modes of 
problem presentation would be un- 
derstandable. Indeed, Helson and 
Helson conclude that no sharp line 
can be drawn between concrete and 
symbolic procedures; most individ- 
uals use both in actual thinking. 

Gestalt theory, with its emphasis 
on reorientation within a problem, 
also bears some relationships to stud- 
ies of ‘‘variation among problem ele- 
ments,”” and to studies concerned 
with ‘‘methods of understanding.” It 
is possible that Ss could be trained in 
reorienting as a method of under- 
standing, and that such a skill would 
transfer to a wide variety of prob- 
lems. 

Other theoretical contributions 
may be mentioned briefly. Flavell 
and Draguns (1957) hold that both 
thought and perception undergo a 
very brief but important microde- 
velopment. Their suggestions deal 
with a matter to which too little at- 
tention has been paid, viz., the sets 
that are instantaneously induced 
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upon initial perception of a problem. 
Stolurow, Bergum, Hodgson, and 
Silva (1955) present a probabilistic 
model of trouble shooting. The prob- 
ability that each of several defects 
may be causing malfunction in air- 
plane engines and the time to repair 
each defect are combined in a ratio 
to indicate which order of checking 
defects one should follow for most 
efficient repair. It seems likely that 
the same sort of model could be 
worked out for other complex appa- 
ratus problems such as Fattu, Mech, 
and Kapos’ (1954) gear train. Hum- 
phrey (1951), Johnson (1955), and 
Weaver and Madden (1949) all make 
several points relevant to the de- 
velopment of problem solving theory. 
Mayzner (1955) first develops pre- 
dictions from theories of Hull, Wer- 
ner, and an earlier theory of Under- 
wood, then shows how each theory 
fared in comparison to his data. 

Several of the prototheories re- 
viewed here seem promising. How- 
ever, they have not yet been directly 
followed up by much experimenta- 
tion, and those who do experimental 
work have made little effort to relate 
their results to what theory is avail- 
able. This lack of rapprochement be- 
tween existing theory and existing 
data is another one of the reasons 
why the area of problem solving 
shows lack of integration. 

Some further points may be made 
with regard to theory in problem 
solving. First, problem solving in 
human adults is to a considerable ex- 
tent a matter of transferring past- 
learned skills and responses to the 
immediate problem situation. In one 
way or another this fairly obvious 
point has been implied by many in- 
vestigators, even those who hold toa 
distinction between productive and 
reproductive thinking. Yet relatively 
little use has been made of existing 
transfer theory or data. For example, 
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many problems can be interpreted as 
negative transfer situations.? Some 
of the variables of which negative 
transfer is a function are known from 
studies of other types of human learn- 
ing and performance, both verbal and 
motor. These studies provide many 
suggestions for research, and to some 
extent for theory, in problem solving. 
Negative transfer is merely an ex- 
ample. A thoroughgoing transfer ap- 
proach to problem solving could also 
make use of much that is known 
about positive transfer, including the 
possibility of “learning to think” 
(Harlow, 1949; Underwood, 1952; 
Weaver & Madden, 1949). 

Second, except for Underwood's 
(1952) paper, and perhaps a few sug- 
gestions by Bruner, Goodnow, and 
Austin (1956), almost nothing has 
been done to relate, theoretically or 
experimentally, the area of problem 
solving to the large literature on con- 
cept formation. Yet the initial dis- 
covery of relevant among irrelevant 
dimensions in concept formation is 
probably not basically different from 
discovery of the correct solution in 
problem solving. Both Riley (1952) 
in problem solving, and Richardson 
and Bergum (1954) in concept forma- 
tion, have recognized separate dis- 
covery and fixation phases in per- 
formance. The discrete S-R problems 
used by several investigators (e.g., 
Brush, 1956; French, 1954; Marx et 
al., 1956; Noble, 1955; Ray, 1957; 
Riley, 1952) can probably be modi- 
fied to vary continuously from “‘pure”’ 
concept formation to “‘pure”’ problem 
solving tasks. 

Finally, despite what has been said 
above concerning theory, the review- 
er’s position is the same as that of 
Ray (1955) and Underwood (1952). 


? Several illustrations of this statement were 
pointed out to the reviewer by Rudolph W. 
Schultz, whose suggestions were of cansider- 
able help. 
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These authors emphasize that al- 
though theoretical developments are 
not necessarily unwelcome, the basic 
need in problem solving is experi- 
mental determination of the func- 
tional relationships between dimen- 
sionalized independent variables and 
problem solving performance. 
CONCLUSIONS 

The following conclusions are sug- 
gested. Problem solving in human 
adults is a name for a diverse class of 
performances which differs, if it dif- 
fers at all, only in degree from other 
classes of learning and performance, 
the degree of difference depending 
upon the extent to which problem 
solving demands location or integra- 
tion of previously learned responses. 
Problem solving performance varied 
most clearly as a function of simple 
sets, of a few kinds of complex sets, of 
changes in the relationships among 
elements of a problem, of level of 
problem difficulty, of aids toward so- 
lution, and of certain characteristics 
of the subject, especially sex, age, and 
reasoning ability. The variables that 
influence simple sets were largely 
those that affect performance, and 
that affect performance in similar 
ways, in other situations. Individual 
differences in problem solving profi- 
ciency appeared to be relatively 
stable. 

Problem solving was_ usually, 
though not always, unaffected by 
differences in the degree of concrete- 
ness or abstractness of versions of the 
same problem. Other variables and 
conditions either yielded conflicting 
results, or more commonly, were em- 
ployed in too few studies to warrant 
a conclusion. 

Groups produced more or better 
solutions to some problems than did 
individuals; on most probiems there 
was no difference. Individuals were 
superior to groups on measures of 
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efficiency. Research on problem solv- 
ing processes revealed very diverse 
patterns of behavior. Problem solv- 
ing theories that show some promise 
are beginning to be developed. 

The field of problem solving is 
poorly integrated. The reasons for 
this seem to be the use of a great vari- 
ety of tasks to provide problems, the 
frequent use of unanalyzed and non- 
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dimensionalized variables, the lack 
of an agreed-upon taxonomy of be- 
havioral processes, and to some ex- 
tent the failure to relate data to 
other data or to theory. Problem 
solving particularly needs research to 
determine the simple laws between 
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RELIEF OF CHRONIC PAIN BY PREFRONTAL LEUCOTOMY, 
OPIATES, PLACEBOS, AND HYPNOSIS 
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The response to a nociceptive stim- 
ulus normally includes at least four 
components: “the sensation of pain”’; 
discomfort; withdrawal movements; 
and some measurable physiological 
alteration, e.g., a transient or pro- 
longed increase or decrease in blood 
pressure (Nafe & Wagoner, 1938; 
Goetzl, Bien, & Lu, 1951). This 
paper is concerned with the neuro- 
logical correlates of this total re- 
sponse—hereafter termed the pain re- 
sponse—and how this total response 
or some components of this response 
can be mitigated or eliminated by 
prefrontal leucotomy, opiates, place- 
bos, and hypnosis. 


THE NEUROPHYSIOLOGICAL Cor- 
RELATES OF THE PAIN RESPONSE 
Free Nerve Endings: The So-Called 

“Pain Receptors” 


It has generally been assumed that 
the free nerve endings, which are 
found widely scattered near the cu- 
taneous and visceral surfaces, are the 
Specific receptors for noxious stimuli. 
However, Sinclair, Weddell, and 
Zander (1952) have shown that Ss 
can discriminate cold, heat, touch, 
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and prick just as well from the ear 
pinna, which contains only bare 
nerve endings and a basketlike net- 
work around the hair follicles, as they 
can from the skin of the forearm, 
which contains all of the encapsu- 
lated endings which have been de- 
scribed. Lele, Weddell, and Williams 
(1954) have demonstrated that the 
free nerve endings in the skin, when 
suitably stimulated, “give rise to a 
wide range of sensory experience 
which includes reports of ‘cold,’ 
‘touch,’ ‘warm,’ ‘prick,’ ‘itch,’ and 
‘sharp pain.’” Lele and Weddell 
(1956) have confirmed earlier findings 
(e.g., Nafe & Wagoner, 1936) that Ss 
report not only pain but also touch, 
warmth, and cold when appropriate 
stimuli are applied to the center of 
the cornea which contains only free 
endings. These and other investiga- 
tions recently reviewed “y Weddell 
(1955) and Sinclair (1955) indicate 
that a wide variety of sensory ex, eri- 
ences can be evoked by suitable stim- 
ulation of the free nerve terminals 
and that theories of cutaneous sensi- 
bility postulating specific receptors 
for each sense modality are open to 
serious objection at the present time.* 


Peripheral Conduction 


A series of earlier investigations, 
reviewed by Bonica (1953, pp. 29- 


* The possibility remains that the term free 
nerve endings does not refer to homogeneous 
units. If future investigations demonstrate 
specific biochemical differences between these 
endings, the question of specific receptors for 
the various sense modalities may require fur- 
ther investigations at the molecular level. 
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30), had apparently shown (a) that 
asphyxia or pressure applied to a per- 
ipheral nerve blocked the large, my- 
elinated, fast-conducting A _ fibers 
first and abolished touch before pain 
and (b) that cocaine blocked the 
small, unmyelinated, slow-conduct- 
ing C fibers first and abolished pain 
before touch. This appeared to be 
satisfactory evidence that touch is 
correlated with conduction in the 
larger A fibers and that pain is cor- 
related with conduction in the small 
C fibers.? Other data, however, indi- 
cate that noxious stimuli applied to 
the cutaneous surface activate many, 
if not all, of the fiber types present 
in the cutaneous nerves, viz., the 
smaller A fibers and the C fibers. 
Since the large A fibers are present 
only in the muscle branches of the 
nerves and the Group B fibers consist 
of sympathetic preganglionic axones 
(Lloyd, 1955; Ruch, 1955, p. 334), 


* At the present time, extreme caution is 
necessary in drawing conclusions from the 
earlier experiments on reversible nerve blocks 
produced by asphyxia, compression, cocaine, 
etc. (Jones: 1956, 1958; Schiller, 1956). Ina 
series of carefully controlled studies of nerve 
blocks produced by procaine, compression, 
and cooling, Sinclair and Hinshaw (Sinclair, 
1955) have demonstrated that it is possible 
to obtain almost any order of sensory loss by 
using different Ss, by varying the site stimu- 
lated, and by altering the nature of the stimu- 
lus. 

The question of “‘double pain” and its re- 
lation to conduction in A-delta and C fibers 
has also been opened for further inquiry. 
“Double pain"’ may be due to inadequate con- 
trol of the stimulus at the receptor level: 
Jones (1956) demonstrated that (physio- 
logically normal) Ss do not report double pain 
if the stimulus ts prevented from stimulating the 
same receptors more than once. Sinclair (1955, 
p. 594) has also concluded from his own work 
and from earlier investigations in this area 
that “the question of second pain cannot be 
regarded as settled and the idea of two sets of 
pain fibres rests upon work which is not im- 
mune to criticism of the experimental findings 
as well as the interpretations placed upon 
them.” 


431 


they cannot be directly activated by 
cutaneous | stimuli. Heinbecker, 
Bishop, and O’Leary (1933) demon- 
strated with a human subject that 
electric shock applied to an exposed 
nerve in such a manner as to stimu- 
late the A-delta fibers consistently 
evoked a pain response. Zotterman 
(1939) reported that a burning stim- 
ulus applied to the skin of cat evoked 
a spike composition that included 
both the A-delta and C fibers. Brook- 
hart, Livingston, and Haugen (1953) 
demonstrated that stimulation of the 
tooth pulp (which normally evokes a 
pain response) yields conduction 
characteristics of the A gamma-delta 
fibers. After reviewing these and 
other investigations attempting to 
relate the modalities of sensation to 
conduction in specific fibers, Living- 
ston (1943), Bonica (1953), Sinclair 
(1955), and Schiller (1956) agree with 
Gasser (1943, p. 59) that “the fibers 
belonging to different modalities 
must be widely distributed through- 
out the various fiber sizes, and that 
there seems to be little possibility of 
associating any one sensation with an 
elevation in the electroneurogram.” 

If all cutaneous stimuli activate 
“fibers widely distributed throughout 
the various fiber sizes,’’ what deter- 
mines the differential response to 
each stimulus? To account for this 
differential response, investigators in 
this area (Bishop, 1946; Weddell, 
1955; Sinclair, 1955) hypothesize 
that each cutaneous stimulus sets off 
a pattern of nerve impulses which 
differs from the pattern set off by 
other stimuli in that the relative num- 
ber of activated fibers of various sizes 
differ, and the relatively different 
sizes carry impulses differing in en- 
ergy value, frequency, and duration. 
A light touch, for example, preferen- 
tially activates the larger fibers. 
However, this does mot mean that 
these large fibers are specific to light 
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touch stimuli. The larger the fiber 
the lower its threshold. The slight 
disturbance caused by light touch, 
therefore, activates the largest fibers 
with the lowest threshold more readily 
than the smaller fibers with higher 
thresholds. Similarly, nociceptive 
stimuli applied to the cutaneous sur- 
face may more readily activate fibers 
in the C range, but this does not 
mean that these stimuli do not also 
activate other cutaneous fibers and 
it also does not mean that other 
stimuli cannot activate the C fibers. 


Conduction in the Spinal Cord 


Peripheral nerve fibers, which are 
both myelinated and unmyelinated 
and which may belong to either Class 
A or C, travei along the cranial, 
spinal, or sympathetic nerves to 
posterior root ganglia where they 
synapse with second order neurons. 
The generally accepted view is that 
noxious stimuli applied to the viscera 
and to subcutaneous and cutaneous 
structures activate those fibers in the 
spinal cord which cross through the 
anterior white commissure to the op- 
posite lateral funiculus where they 
ascend cephalad, forming the lateral 
spinothalamic tract. However, this 
is by no means the complete story. 
True enough, in many cases, cutting 
the lateral spinothalamic tract pre- 
vents a response to many nociceptive 
stimuli applied to the contralateral 
side. However, in some cases, this 
loss is temporary; normal pain re- 
sponsiveness may return after an in- 
tervening period (Ranson, 1943, p. 
111). Also, this operation (antero- 
lateral cordotomy) does not abolish 
pain discrimination while leaving 
temperature, touch, and pressure dis- 
crimination intact. As Schiller (1956, 
p. 208) points out, “One modality or 
two are never either completely 
spared or abolished to the absolute 
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exclusion of the others . . . parts de 
nervated by anterolateral cordot- 
omy are reported as feeling ‘numb, 
discrimination of two points and tex- 
ture of materials is diminished, and, 
in addition, there are thermanes- 
thesia and analgesia.’’ In addition, 
White and Sweet (1955, p. 45) dem- 
onstrated that a current at 100 or 
more volts applied to the “analgesic 
side"’ invariably produced a report of 
severe pain in all (40) patients ex- 
amined. King (1957) confirmed this 
finding and, in addition, found that 
(after anterolateral cordotomy) the 
maximum elevation of the pain 
threshold on the “‘analgesic side’’ did 
not exceed 40 to 50°%. Since, in the 
great majority of cases, the pain 
threshold elevation was much less 
than this maximum, and, in some 
cases, was not significantly different 
from the pain threshold on the nor- 
mal side, King concluded that “a 
polysynaptic relay pathway for pain- 
ful stimuli in man, aside from the 
spinothalamic system, seems prob- 
able.” 

Not only does anterolateral cord- 
otomy consistently fail to abolish the 
response to more intense noxious 
stimuli, it also fails to affect the pain 
response to pinpricks in the majority 
of cases. White and Sweet (1955, p. 
262) found that, after this operation, 
60°; of their patients consistently re- 
ported pain when multiple rapid pin- 
pricks were applied to the “analgesic 
side.” 

French and Peyton (1948), Voris 
(1951), and White and Sweet (1955, 
p. 45) have presented additional evi- 
dence indicating that nociceptive 
stimuli activate fibers that do not 
cross to the opposite side in the spinal 
cord. Each of these investigators has 
reported cases in which the “anal- 
gesia’”’ was ipsilateral following anter- 
olateral cordotomy. From these re- 
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ports White and Sweet (1955, p. 275) 
conclude that the “pain fibres are 
diffused over a very wide area of the 
anterolateral quadrant, and that at 
times some centrally conducting 
fibres must run upwards in the ipsi- 
lateral as well as in the contralateral 
columns, at least for a considerable 
distance.” 

Even the above account is incom- 
plete; nociceptive stimuli can acti- 
vate far more units in the cord than 
those found on both sides of the an- 
terolateral quadrant. Livingston 
(1943) and White and Sweet (1955) 
have found that in many cases bilat- 
eral anterior cordotomy is insufficient 
to relieve a pain svndrome and 
Lhermitte and Puech ‘1946), Pool 
(1946), and Browder and Gallagher 
(1948) have demonstrated that some 
pain syndromes can be relieved by 
posterior cordotomy. Keele (1957, p. 
164) has reviewed additiona) evi- 
dence which indicates that the ‘‘pain 
tracts’’ may be widely dispersed in 
the cord and concludes that “one is 
induced to look upon their anatomy 
as one of statistical probability, and 
to wonder how closely or how perma- 
nently the function of transmission 
of pain sense is attached to fixed 
neuronal paths in the cord.”” After 
summarizing the evidence in this 
area, Adey (1957) similarly concludes 
that the concept of localized fiber 
pathways in the spinal cord carrying 
particular types of sensory impulses 
is open to serious question and re- 
quires revision. 

A number of investigators have 
interpreted their data as indicating 
that the same stimulus may activate 
different neural units in the cord at 
at different times. Gasser (1937) 
writes that ‘‘a given stream of affer- 
ent impulses over a peripheral nerve 
follows one pathway in the centers 
at one time and another pathway at 
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another time. The direction of the 
switching is conditioned by the situa- 
tion obtaining at the moment, and is 
always consonant with a coordinated 
reaction of the whole organism.” 
Along similar lines, Livingston (1943, 
p. 25) interprets the evidence as indi- 
cating that “impulses, finding them- 
selves blocked from their customary 
pathways, eventually find new or pre- 
viously unused pathways.” Bishop 
(1944) likewise suggests that when 
impulses along a neural pathway 
reach a certain critical frequency, 
they are ‘“‘switched”’ to different con- 
duction units from those into which 
they normally pass. 

A number of other considerations 
should be emphasized. First of all, 
there is no need to hypothesize spe- 
cific pathways for pain and other 
modalities to understand the altera- 
tions in sensibility which follow an- 
terolateral cordotomy. As Sinclair 
(1955, p. 606) has pointed out, ‘In- 
stead of cutting specific fibres, we 
may be so altering the sensory pat- 
terns the spinal cord is capable of 
conducting in such a way as to lead 
to a sensory dissociation.’’ Further- 
more, even if we assume a “‘segrega- 
tion” of “‘pain-conducting fibers’ at 
the cord level, we cannot relate this 
“segregation” to the “sensation of 
pain,”’ to discomfort or suffering, or 
to other components of the pain re- 
sponse which appear to require 
higher neurological levels. Whatever 
“segregation”’ of fibers occurs at the 
cord level can be related only to re- 
flex functions at this level and to 
nothing more (Bishop, 1946). It 
should also be noted that afferent 
impulses in the cord can be altered 
by impulses descending from the 
brain stem and cerebrum. Hagbarth 
and Kerr (1954) have demonstrated 
that afferent volleys in the anterior 
columns of the spinal cord are re- 
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duced in amplitude by electrical 
stimulation of the bulbar and mid- 
brain reticular formation, of the pre- 
central and postcentral gyri, and of 
various other forebrain structures. 


Conduction at the Brain Stem Level 


There seemed to be general agree- 
ment, just a few years ago, that spi- 
nothalamic pathways “carried pain” 
without interruption through the 
medulla, pons, and midbrain to the 
posterolateral ventral nucleus of the 
thalamus. Recent evidence indicates 
that this also is an incomplete ac- 
count. First of all, there is little 
doubt that the great majority of the 
fibers from the anterolateral funicu- 
lus of the cord terminate at levels 
below the thalamus (Walker, 1940; 
Walker, 1943; Bowsher, 1957). Fur- 
thermore, an extensive series of in- 
vestigations, recently summarized by 
Magoun (1958), indicate that the 
“classical sensory pathways’ (includ- 
ing the “pain” pathways) give off 
collaterals to the reticular formation 
of the brain stem (and to the “dif- 
fusely”’ projecting thalamic nuclei) 
and that appropriate electrical stimu- 
lation of this zone of collateralization 

“the reticular activating system” 

causes a desynchronization of elec- 
trical activity throughout wide areas 
of the cortex such as is seen in the 
“‘arousal” reaction in the normal ani- 
mal. In line with this evidence, it 
has been demonstrated that anes- 
thetics exert their primary effect in 
blocking the response to noxious 
stimuli (as weil as other stimuli) by 
preventing conduction through the 
reticular area of the brain stem 
(French & King, 1955). French, 
Verzeano, and Magoun (1953) re- 
ported that both sodium pentobarbi- 
tal and ether depress conduction 
through the reticular formation while 
the direct afferent pathways con- 
tinue to conduct impulses in normal 


manner. Similarly, Arduini and 
Arduini (1953), Peterson (1955), and 
Haugen and Melzack (1957) found 
that the potentials in the reticular 
formation were much more suscepti- 
ble to procain, nitrous oxide, and 
other drugs than the potentials in 
the direct afferent pathways. 

In summary, recent evidence seems 
to be consistent with Melzack, Stot- 
ler, and Livingston's (1958) conclu- 
sion from their study of brain stem 
lesions in the cat: 

Whatever the nature of pain perception 
may be, its neural substrates appear to be 
much more complex than that envisaged in a 
single ascending system. The patterns of im- 
pulses subserving pain appear to travel over 
multiple pathways at the brainstem level at 
least, and the ultimate perceptual event seems 
to depend upon activities occurring along all 
of these pathways (p. 365). 


Conduction at the Thalamo-Cortical 
Level 

It has generally been assumed that, 
aiter synapsing at the posterior ven- 
tral nucleus of the thalamus, “pain 
fibers’’ project to the postcentral con- 
volution of the cortex. However, as 
Walker (1943) has pointed out, the 
posterior ventral nucleus has numer- 
ous connections with the adjacent 
thalamic nuclei and impulses can be 
conveyed to wide areas of the cere- 
bral cortex in this indirect fashion.‘ 
Also, as pointed out above, nocicep- 
tive stimuli applied to visceral, 
somatic, and cutaneous structures 
also activate neurons in the reticular 
formation which sends impulses to 
many cortical areas over both tha- 
lamic and extrathalamic pathways. 
In line with these considerations, 


* Murphy and Gellhorn (1945) report that 
strychninization of this thalamic nucleus also 
leads to firing in the ipsilateral and contra- 
lateral hypothalamus. Apparently, this is one 
of the pathways involved in the general 
hypothalamic excitation which follows periph- 
eral noxious stimulation (Gellhorn & Ballin, 
1946). 
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Gellhorn and Ballin (1946) report 
that noxious stimuli applied to the 
periphery of narcotized animals alter 
electrical activity throughout the en- 
tire cortex, and Benjamin and Ivy 
(1949) report that noxious stimuli 
applied to the extremities of human 
Ss evoke a nonspecific decrease in 
amplitude of the waves from the 
parietal, occipital, temporal, and 
frontal areas.® 

In general, the evidence summar- 
ized below indicates that ‘‘adequate” 
stimulation of the cerebral cortex 
may elicit reports of pain; and that 
damage to a number of cortical areas 
may affect “the sensation of pain”’ 
and the withdrawal movements 
which normally follow noxious stimu- 
lation. 

In rare instances, electrical stimu- 
lation of the cerebral cortex, espe- 
cially of the precentral, postcentral, 
and superior parietal gyri, has been 
followed by “a sensation of pain” 
localized in the face, limbs, trunk, or 
other body area (Penfield & Boldrey, 
1937; Horrax, 1946; Lewin & Phil- 
lips, 1952). Also, in a few patients, 
destruction of the postcentral, supe- 
rior parietal, superior temporal, and 
insular convolutions (Davison & 
Schick, 1935); or tumors in the pari- 
etal lobe alone or in the parietal plus 
the frontal or occipital lobe (Michel- 
son, 1943); or tumors in the right or 
left parietal, frontal, and temporal 
areas (Bender & Jaffe, 1958); have 
been reported to give rise to “‘spon- 
taneous pain” referred to various 
body areas. However, we cannot 


§ Although wide ares of the cerebral cortex 
are normally activated by peripheral nocicep- 
tive stimulation, it is quite certain that some 
of the components of the pain response can be 
carried out by animals lacking this structure. 
In pontile cats, for example, Bard and Macht 
(1958) report that a strong nociceptive stimu- 
lus elicits growl-like vocalizations, protrusion 
of claws, running movements, piloerection, 
and increased respiratory and cardiac activity. 
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conclude from these reports that the 
cerebral cortex ‘‘subserves’’ some spe- 
cial function in the “perception of 
pain.” Electrical stimulation of 
many other neural tissues also, at 
times, evokes referred “‘pain sensa- 
tions’”’ (Sweet, White, Selverstone, & 
Nilges, 1950). Also, a series of in- 
vestigations, summarized by Bonica 
(1953, p. 131), White and Sweet 
(1955, pp. 526-528), and Bender and 
Jaffe (1958), indicate that pathologi- 
cal processes in the spinal cord, the 
brain stem, and the thalamus may 
also produce “spontaneous pain”’ in- 
distinguishable from that produced 
by lesions or tumors in the cortex. 
Furthermore, the pain response which 
follows electrical stimulation or path- 
ological processes in the cortex could 
possibly be integrated by subcortical 
structures. Finally, it should be em- 
phasized that reports of pain follow- 
ing electrical stimulation or lesions 
in the cerebral cortex are so rare that 
Penfield and Rasmussen (1950, p. 3) 
conclude that “the thalamus retains 
the problem of disposing of pain im- 
pulses without calling on the cortex 
for essential help.” 

In rare instances, cortical lesions 
have been reported to prevent “‘the 
sensation of pain” or ‘‘the recogni- 
tion of the stimulus” without affect- 
ing other components of the pain re- 
sponse. Gilliatt and Pratt (1952) 
have reported tat after a “right- 
sided cerebral thrombosis” a patient 
did not “consciously recognize’”’ noxi- 
ous stimuli applied to the left side of 
the body, even though the stimuli 
gave rise to general restlessness, in- 
creased blood pressure, tachycardia, 
deepening of respiration, and dilata- 
tion of the pupils. Marshall (1951) 


has also published 11 cases of left and 
right parietal lesions which were fol- 
lowed by a deficient ‘‘pain sensation” 
when pinprick, or intravenous injec- 
tion of hypertonic sodium chloride, 
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was applied to areas contralateral to 
the lesion. These reports are excep- 
tional; localized cortical lesions are 
not usually followed by alterations in 
“the sensation of pain.” Penfield 
(cited by White & Sweet, 1955, p. 
109), after wide experience with corti- 
cal ablations, states that he has 
“never seen a patient who had a 
parietal lesion lose sensation of pain 
excepting for a few hours or days fol- 
lowing excision.”” White and Sweet 
(1955, p. 63) similarly conclude after 
reviewing the evidence that ‘“‘studies 
in man following localized cortical 
extirpations reveal little reduction of 
pain sensation upon peripheral stimu- 
lation, and confirm the huge extent 
of the cortex concerned with sensa- 
tion.” 

Damage to a number of cortical 
areas may affect the purposive with- 
drawal movements which normally 
follow nociceptive stimulation. 
Schilder and Stengel (1931) pub- 


lished a study of 3 patients with 


tumors or lesions of the left parietal 
lobe (with additional lesions, in two 
of the cases, in the frontal or tem- 
poral lobe) who did not withdraw from 
noxious stimuli, threatening gestures, 
loud noises, or sudden flashes of light. 
Similarly, Hemphill and Stengel 
(1940) reported that a patient with a 
probable lesion of the left labyrinth 
failed to show withdrawal responses 
to noxious stimuli and to unexpected 
sounds. Although the patient ‘‘ad- 
mitted that he could feel the painful 
stimulus” and that he could hear an 
automobile horn, he failed to show 
withdrawal or defense reactions when 
a match was struck close to his eyes 
and when an automobile horn threat- 
ened his life. Rubins and Friedman 
(1948) have published a similar study 
of four patients with lesions in or 
around the supramarginal gyrus of 
the dominant hemisphere who showed 
a lack of withdrawal to noxious 


X. BARBER 


stimuli and to threatening gestures 
even though they “felt” pain and 
were aware of the threatening char- 
acter of the gestures. The latter in- 
vestigators emphasize that only cer- 
tain motor withdrawal reactions ap- 
pear to be normally integrated in or 
around the damaged areas. 
Although specific unilateral lesions, 
in some instances, result in deficien- 
cies in pain responsiveness, it by no 
means follows that the more exten- 
sive the unilateral lesion, the more 
deficient the response. On the con- 
trary, removing either the right or 
left cerebral hemisphere either does 
not seriously affect the response to a 
noxious stimulus or alters only the 
response to lower intensities of stimu- 
lation. Dandy (1933) reported two 
cases of extirpation of the right cere- 
bral hemisphere; in both patients the 
response to a pinprick on the contra- 
lateral side was seriously deficient, 
but movements of joints and com- 
pression of muscles on either side of 
the body brought forth a pain reac- 
tion with an intense ‘feeling’? com- 
ponent. Gardner (1933) found that 
20 months after right hemispherec- 
tomy firm pressure with a pin (on the 
contralateral or ipsilateral side) was 
recognized as “painful.” Zollinger 
(1935) reported that after removal of 
the left cerebral hemisphere (in a 
right-handed woman), the patient 
showed “acute pain with motion of 
the joints or compression of the deep 
muscles.’”” Rowe (1937) stated that, 
after removal of the right hemi- 
sphere, his patient responded nor- 
mally to nociceptive stimuli applied 
anywhere on the ipsilateral side and 
to scattered areas on the contralat- 
eral side. Somewhat in contrast to 
the above are the later reports of 
Evans (cited by Walker, 1943), Bell 
and Karnosh (1949), Krynauw (1950) 
12 patients—, and Marshall and 
Walker (1950)—4 patients: a few 
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months after hemispherectomy, most 
of their patients showed normal pain 
responses and accurate localization of 
pinprick applied on either side of the 
body. 

From our current neurological con- 
cepts we might assume that after 
hemispherectomy the ipsilateral thal- 
amus integrates the response to a 
nociceptive stimulus on the contra- 
lateral side. This is by no means the 
case. The hemispherectomized chim- 
panzee shows practically complete 
degeneration of all the ipsilateral 
thalamic nuclei which project to the 
cortex (Walker, 1943). There is no 
reason to suspect that the same retro- 
grade thalamic degeneration does not 
occur in man. Apparently, the re- 


maining cerebral hemisphere and the 
thalamic nuclei on the same side are 
sufficient to integrate the response to 
nociceptive stimuli on either side of 
the body. 

The above studies appear to indi- 


cate the following: 


1. Noxious stimulation in the periphery 
alters electrical activity in many cortical 
areas. 

2. Referred “pain sensations” are at times 
evoked by electrical stimulation or pathologi- 
cal processes at any level of the neuraxis, 
including the cerebral cortex. 

3. In rare instances, localized cortical le- 
sions abolish ‘“‘the sensation of pain,”’ i.e., the 
ability to:discriminate a noxious stimulus and 
to differentiate it from other stimuli, without 
affecting other components of the pain re- 
sponse. Also, in rare instances, localized cor- 
tical lesions abolish the avoidance movements 
which normally follow noxious stimulation. 

4. Removal of either the right or left cer- 
ebral hemisphere either does not seriously 
affect the response to nociceptive stimuli or 
alters only the response to relatively non- 
intense stimuli applied to the contralateral 
side. 

5. Although the decorticate animal shows 
some components of the pain response—e.g., 
running movemerts and increased respiratory 
and vasomotor activity—, the tmtact organ- 
ism probably utilizes a variety of cortical 
neuronal mechanisms when carrying out the 
total response to a noxious stimulus. This is 
further exemplified in the following discussion 
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on the mitigation of the discomfort-suffering 
component of the pain response by prefrontal 
leucotomy. 


““RELIEF OF PAIN” BY PREFRONTAL 
LEUCOTOMY 


During recent years an extensive 
group of patients has undergone pre- 
frontal leucotomy (or “lobotomy’’) 
for the relief of severe, intractable 
pain syndromes such as causalgia, 
postherpetic neuralgia, metastatic 
carcinoma, thalamic syndrome, etc. 
(e.g., Van Wagenen, cited by Walker, 
1943; Dynes & Poppen, 1949; Free- 
man & Watts, 1950). Although some 
patients died soon after the operation 
and others were not “relieved of 
pain,” others were “relieved”’ (at 
least for an extended time period) 
and further analysis of this effect may 
give us an increased understanding of 
the pain phenomenon. 

First of all, it is necessary to point 
out that intractable pain has been 
alleviated in some patients not only 
by bilateral frontal leucotomy, which 
supposedly destroys the thalamo- 
frontal projections, but also by uni- 
lateral frontal leucotomy; by bilat- 
eral lower quadrant frontal leucot- 
omy; by topectomy (i.e., by remov- 
ing limited areas, such as Brodmann’s 
Areas 9 and 10, from the frontal 
lobes); and by a number of other 
operations on the frontal areas which 
have been summarized by Sargant 
and Slater (1954). Secondly, the 
“pain relief’’ which may follow these 
operations does not appear to be re- 
lated to the specific prefrontal areas 
affected; on the contrary, the degree 
of ‘‘relief”’ appears to be a nonspecific 
effect, closely related to the extent of 
the prefrontal damage (Petrie, 1951; 
Hardy, Wolff, & Goodell, 1952; 
Petrie, 1958; Elithorn, Glithero, & 
Slater, 1958). 

It must be further emphasized that 
only some patients have been helped 
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by these procedures. Walker (1950) 
estimates that at least one third of 
the patients receiving these opera- 
tions have not had any “pain relief.”’ 
A representative report is Hardy, 
Wolff, and Goodell’s (1952) analysis 
of 38 prefrontal leucotomies (25 uni- 
lateral and 13 bilateral) performed 
by Dr. Bronson Ray at the New York 
Hospital for the relief of pain syn- 
dromes related to metastatic cancer, 
Hodgkin's disease, radiculitis, tabes, 
etc. Of the 25 patients receiving uni- 
lateral leucotomy, 10 were relieved of 
pain and 15 showed no alteration in 
their pain syndrome. Of the 13 pa- 
tients receiving the bilateral opera- 
tion, 11 were relieved and 2 were not 
helped. The term relief of pain, as 
used by these investigators, implies 
that when the patient was directly 
asked if he had pain, he replied either 
that he no longer had it, or that it was 
still present but of lower intensity 
than before, or that it was still pres- 
ent but no longer “bothered” him. 
Although some investigators use an 
additional criterion—viz., that the 
patient no longer asked for drugs— 
most investigators also use the above 
criteria. 

A further point should be empha- 
sized: postmortem examinations of 
leucotomized patients indicate that 
in some cases the prefrontal areas 
are not damaged and the thalamo- 
frontal projections are mot severed. 
In a postmortem study of 15 pa- 
tients who had undergone transorbital 
lobotomy for pain of malignant dis- 
ease, Freeman and Williams (1951) 
found that 3 cases were characterized 
by massive hemorrhage, 2 cases failed 
to involve the thalamofrontal pro- 
jections, and the other cases appar- 
ently showed destruction of the thal- 
amofrontal radiations with retro- 
grade degeneration of the dorso- 
medial nucleus of the thalamus. 
Meyer and Beck (1945) also report 
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from postmortem studies that the 
prefrontal lobe is at times entirely 
untouched and that severance of the 
thalamofrontal fibers is often incom- 
plete. 

When prefrontal leucotomy allevi- 
ates intractable pain it does not nec- 
essarily elevate the pain threshold or 
alter “the sensation of pain.’’ Chap- 
man, Solomon, and Rose (1950) 
found a lowering of the pain thresh- 
old immediately after the bilateral 
operation followed by a return to pre- 
operative levels after an intervening 
time period. Hardy et al. (1952) re- 
ported that the pain threshold in 10 
postleucotomy patients, who were 
relieved of their pain syndrome, 
showed no significant difference from 
the preoperative level. King, Clau- 
sen, and Scarff (1950) noted a slight 
lowering or no change in the pain 
threshold after successful unilateral 
leucotomy for intractable pain. Also, 
with few, if any, exceptions, investi- 
gators report that the “‘sensation”’ or 
“perception” of pain is practically 
unaltered by any of these procedures: 
e.g., ‘Prefrontal lobotomy changes 
the attitude of the individual toward 
his pain, but does not alter the per- 
ception of pain” (Freeman & Watts, 
1950, p. 354). 

The evidence available at present 
also indicates that, if and when pre- 
frontal leucotomy relieves a pain 
syndrome, the relief is secondary to 
a more generalized effect of the op- 
eration which, at times, can be con- 
ceptualized as apathy, i.e., as a de- 
creased responsiveness to all stimuli 
—including nociceptive — stimuli. 
Hardy et al. (1952, p. 317) empha- 
size that postleucotomy patients who 
were either partially or totally re- 
lieved of pain “exhibited in many 
ways...a flattened affect if not 
actual apathy. ... They failed not 
only to complain of their spontane- 
ous pain but also of their needs, such 
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as personal nursing care, need of urine 
bottle, bedpan, or the adjustment of 
an uncomfortable dressing. When 
incontinent of feces they were indif- 
ferent to the odor it spread about 
their persons and beds.” Bonner, 
Cobb, Sweet, and White (1952) have 
also emphasized that apathy charac- 
terized their patients immediately 
following bilateral lower quadrant 
frontal leucotomy. Although the 
apathy tended to lessen with the pas- 
sage of time, it was still a character- 
istic feature in patients followed up 
to 36 months postoperatively. 
Although many patients who are 
relieved of intractable pain by pre- 
frontal operations do not show the 
extreme apathy described above, the 
evidence indicates that all patients 
who are helped by these operations 
show a characteristic personality al- 
teration (Krayenbiihl & Stoll, 1950; 
Petrie, 1952; Petit-Dutaillis, Mes- 
simy, & Berges, 1953; Elithorn, 
Glithero, & Slater, 1958); and that 
patients who are not helped and pa- 
tients who have undergone other op- 
erations which do xot mitigate in- 
tractable pain, e.g., temporal lobot- 
omy, cingulectomy, and orbital un- 
dercutting, do not show the same 
change in personality (Petrie, 1958). 
This characteristic alteration has re- 
ceived a wide variety of formulations: 
Dynes and Poppen (1949) concep- 
tualize it as a decrease in “worry” 
and “‘concern’’; Le Beau (1950) terms 
it the relief of ‘“‘anxiety’’; Elithorn et 
al. (1958) formulate it as an “im- 
paired ability to elaborate a persist- 
ing attitude or mood.”’ These formu- 
lations are not necessarily in basic 
disagreement; they appear to be re- 
{erring to a common behavioral ma- 
trix, viz., to a mitigated ‘‘readiness to 
respond” to external and internal 
stimuli. Summarizing the investiga- 
tions in this area, Walsh (1957, p. 
474) writes: ‘The patient suffering 
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from pain complains less of his dis- 
comfort than before. Not... a fail- 
ure to appreciate the situation but a 
failure to respond to it. . . . This fail- 
ure to react is seen when stimuli that 
arise within the body itself are con- 
sidered; but there may also be a dim- 
ished response to external situa- 
tions.” 

It should be emphasized that the 
leucotomized patient is able to re- 
spond normally to nociceptive stimu- 
lation. Hardy et al. (1952, p. 316) 
have reported that ‘“‘some patients, 
although ostensibly tranquil before 
being asked about their pain, over- 
reacted with a show of grimacing and 
fears when their attention was focused 
upon it by a direct question concern- 
ing its quality and its intensity” (em- 
phasis added). The same theme is re- 
peated by other investigators; for ex- 
ample, Hawkes and Gotten (1948, p. 
209) report that “when questioned 
[the leucotomized patients] all indi- 
cated that they realized some pain 
was present when they thought about 
it’’ (emphasis added). Apparently, 
when the leucotomized patient is di- 
rectly asked to report on his pain, he 
“focuses his attention” on and “thinks 
about” the ever-present nociceptive 
stimulus in his body and, when thus 
reacting to it, often shows discomfort 
or suffering and almost always re- 
ports a “sensation of pain.” How- 
ever, when the patient is not directly 
asked to report on the noxious tissue 
condition, he does not “‘attend”’ to it 
or “think’’ about it to the same extent 
as before the operation and, when 
not thus reacting to it, does not ap- 
pear to be “‘in pain,” i.e., does not 
show discomfort. 

Apparently, discomfort and suffer- 
ing can be minimized or eliminated 
by preventing a “secondary reaction” 
to the noxious stimulus. Neurosur- 
geons have used somewhat different 
terms to describe this effect: Freeman 
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(1949, p. 18) writes after extensive 
experience with prefrontal operations 
that “‘when the emotion is done away 
with, the pain either becomes no 
longer significant or actually disap- 
pears’’; Otenasek (1948, p. 234) sim- 
ilarly suggests that ‘‘when the fear 
of pain is abolished, the perception of 
pain is not intolerable.” 


Neurophysiological Correlates of Post- 
leucotomy ‘‘Pain Relief” 


Since prefrontal leucotomy miti- 
gates the discomfort-suffering com- 
ponent of the pain response in some 
patients and fails to do so in others, 
since this effect is often temporary, 
and since we are rarely certain in any 
one such‘operation which fiber tracts 
were destroyed, to what extent scar 
formation and vascular damage oc- 
curred, and to what extent the opera- 
tion resulted in physiological dis- 
turbances in other cerebral areas, it is 
extremely difficult to formulate any 
hypothesis concerning the “pain re- 
lief’’ which may follow this operation. 
Also, as Koskoff, Dennis, Lazovik, 
and Wheeler (1948) have pointed out: 

The relief of suffering in such a wide variety 
of patients suggests that the mechanism does 
not involve the interruption of specific pain 
pathways, despite the evidence of thalamic 
degeneration following frontal lobotomy. 
Preservation of the response to painful stimu- 
lation noted in such patients also suggests 
that the interruption of specific afferent pain 
tracts is not responsible for the relief of suffer- 
ing (p. 740). 

Nevertheless, a number of investi- 
gators have proposed a variety of 
mechanisms which may be directly 
or indirectly related to postleuco- 
tomy “‘pain relief.’’ Starzl and Whit- 
lock (1952) have presented evidence 
that the “diffuse” thalamic projec- 
tion system, which exerts a general 
cortical “arousal” effect, projects 
primarily, but not exclusively, to the 
frontal! cortex in ~onkey. From this 
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evidence they hypothesize that “‘pain 
relief”’ following leucotomy is due to 
the destruction of the afferent fibers 
from this system. Fulton (1951, p. 
127) has suggested that frontal leu- 
cotomy relieves pain by removing 
“large numbers of visceral pain pro- 
jections from the sphere of conscious- 
specifically, by destroying 
visceral afferent pathways to the 
orbitofrontal cortex. However, as 
White and Sweet (1955, p. 64) point 
out, ‘‘We know of no evidence... 
that stimulus to the central end of 
any visceral nerve carrying many 
nerve fibers, such as the great 
splanchnic nerve, will cause synchro- 
nous bursts of change of potential 
within this part of the brain in mam- 
mals.’’ Also, Fulton’s suggestion does 
not explain why the leucotomized pa- 
tient appears to have a diminished 
responsiveness to many other stim- 
uli besides noxious stimuli nor does 
it explain why the patient may state, 
when directly asked, that the “pain 
feeling’’ is the “‘same”’ but does not 
matter any more. 


” 
ness, 


The above i:.vestigators have em- 
phasized the destruction of the af- 
ferent projections to the frontal areas 
and have neglected the probable ex- 
tensive destruction of corticofugal 
fibers. In a postmortem investigation 
of six lobotomized patients, Yakov- 
lev, Hamlin, and Sweet (1950, p. 
328) found, that the frontopontine 
tracts were bilaterally and symetri- 
cally degenerated and that “the 
great frontal corticofugal pathway 
descending from the entire anterior 
pre-Rolandic half of the cerebral 
hemisphere was deprived of a large 
and important component.” They 
conclude with a statement that can- 
not be too much overemphasized: 


On the basis of this study jt seems to us that 
in the attempts made thus f° to correlate the 
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behavioral changes following frontal lobotomy 
with anatomy... the degeneration of an- 
terior thalamic radiations and nuclei has been 
stressed to the exclusion of the obvious de- 
generation of the far greater mass of efferent 
projections which connect frontal lobes to all 
the levels of the neuraxis . . . (p. 328). 

In line with this suggestion,'‘a few 
workers have attempted to formu- 
late the effects of leucotomy in terms 
of the destruction of efferent projec- 
tions. Bonner et al. (1952) speculate 
that if the connections between neo- 
cortex and archicortex are severed by 
prefrontal leucotomy “there would 
be less activation of the archicortical 
circuits which probably subserve 
emotional reactions and thus per- 
petuate suffering.”” Arnold (1955, p. 
154) hypothesizes that since the 
dorsomedial nucleus of the thalamus 
degenerates after prefrontal leucot- 
omy and since the frontal lobes ac- 
tivate the sympathetic centers in the 
posterior hypothalamus by connec- 
tions through this nucleus, ‘‘Anxiety 
is reduced because the excitation of 
sympathetic effectors is now pre- 
vented and with it a prolongation 
and intensification of the emotion.” 
However, since sympathectomized 
animals (Cannon, Newton, Bright, 
Menkin, & Moore, 1929) and sym- 
pathectomized human patients (Ray 
& Console, 1949; Grimson, Orgain, 
Anderson, Broome, & Longino, 
1949) apparently respond with nor- 
mal “emotion’’ and “anxiety” to 
many stimuli, it is doubtful that the 
prevention of sympathetic excitation 
is the mechanism involved in the re- 
duced “anxiety”’ or diminished reac- 
tivity of the leucotomized patient. 

In summary, although a number of 
neurophysiological mechanisms are 
apparently nonfunctional after pre- 
frontal leucotomy, we cannot state 
with any degree of certainty which of 
these mecha iisms are necessary for 
the maintenance of intractable pain. 
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Nor are we certain that destruction 
of one or more specific nuclei or nerve 
pathways is closely correlated with 
the effects of this operation. In fact, 
the evidence at present suggests that 
prefrontal leucotomy has different 
effects on. different patients even 
when apparently similar neural tissue 
is destroyed. To account for these 
differential effects we must have not 
only (a) much more preoperative 
data on each patient (e.g., the pa- 
tient’s personality characteristics, his 
general level of reactivity, the dura- 
tion of his pain syndrome) but also 
(6) much more specific postoperative 
data such as the specific tracts de- 
stroyed and the extent of postopera- 
tive hemorrhage, and (c) a better 
understanding of a number of phe- 
nomena which at present are not well 
understood, such as the “reintegra- 
tion” of function which may occur 
after destruction of neural tissue, the 
specific functions of the afferent and 
efferent fibers from the frontal areas, 
etc. When such specific data are 
available, we may be able to account 
both for the patient who is not helped 
by this operation and for the pa- 
tient who is not only relieved of pain 
but also of worry and concern about 
many situations including, in many 
cases, forthcoming death. 


THE PROBLEM OF CONGENITAL 
INSENSITIVITY TO PAIN 


A theory of pain must account for 
the “normal” response to noxious 
stimulation, for the alterations in 
this response by analgesics, place- 
bos, hypnosis, neurosurgical and other 
procedures, and for the antithesis of 
“normal” pain responsiveness, i.e., 
the problem of ‘‘congenital insensi- 
tivity to pain.”” At the present time 
we are far from a complete under- 
standing of the latter phenomenon. 
Nevertheless, within the limits of the 
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evidence as it now stands, certain sig- 
nificant factors stand out that should 
be emphasized. 

McMurray (1955) and Critchley 
(1956) have recently reviewed the 
handful (ca. 16) of well-documented 
cases of “congenital insensitivity.” 
McMurray’s (1950) case can be 
briefly summarized to indicate the 
more or less typical findings in these 
patients: 

A 22-year-old female college stu- 
dent, IQ 128, with no apparent per- 
sonality disorders. A history of con- 
sistent lack of pain responsiveness 
dating at least since early childhood. 
Extensive burns, frostbite, deep cuts, 
and other serious tissue damage ‘“‘had 
gone unnoticed or been looked on in- 
differently.” Her medical history in- 
cluded the incision of a large abscess 
over the occipital bone, osteomyelitis 
of the right calcaneus and of the left 
femur, tonsillectomy and adenoidec- 
tomy, and acute pyelitis, with no 
complaints of pain or tenderness. 
When subjected in the laboratory to 
such noxious stimuli as cold water 
at a temperature of 0° to 2° C., hot 
water at 49° to 51° C., and electric 
shock from an inductorium, she did 
not report pain, did not show wincing, 
withdrawal, or other indications of 
discomfort, and did not show any 
significant alterations in blood pres- 
sure, heart rate, or respiration. Ex- 
tensive neurological examination did 
not reveal any evidence of organic 
neurological disease. 

Although the other reported cases 
generally follow the above pattern, 
there are some differences: 2 patients 
(Ford & Wilkins, 1938, Case 2; 
Kunkle & Chapman, 1943) showed 
epileptic tendencies; 3 patients 
(Kunkle & Chapman, 1943; Arbuse, 
Cantor, & Barenberg, 1949; Cohen, 
Kipnis, Kunkle, & Kubzansky, 1955) 
showed increased diastolic and sys- 
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a “throbbing headache”’ after spinal 
anesthesia; and Jéquier and Deller’s 
(1956) patient reported ‘a little 
pain” when stimulated with a very 
hot object. 

As Critchley (1956) has noted, 
these Ss are not actually ‘‘insensi- 
tive’”’ to noxious stimulation; they 
can detect, identify, and localize 
noxious stimuli and can easily dif- 
ferentiate them from other stimuli. 
McMurray’s (1950) S states that, 
when a hypodermic needle is inserted 
into her skin, she feels it penetrating 
the tissue layers but does not “‘feel 
pain.” Stimuli such as pinprick and 
cutaneous shock and heat produce 
the report of a pricking or sharp qual- 
ity, but she does not describe this 
quality as “‘painful.” In fact, since 
this S can discriminate the sharp 
quality of heat stimulation, McMur- 
ray was able to establish in the pa- 
tient a “threshold” close to the nor- 
mal heat pain threshold. Similarly, 
Ford and Wilkins (1938), Kunkle 
and Chapman (1943), Boyd and Nie 
(1949), Jewesbury (1951), Westlake 
(1952), and Jéquier and Deller (1956) 
have reported that their Ss had no 
difficulty differentiating and localiz- 
ing a nociceptive stimulus; they 
could, for example, easily discrim- 
inate between the blunt and pointed 
end of a pin and had no difficulty 
localizing the pinprick. 

The available evidence indicates 
that m_ 1y, if not all, of these Ss have 
normal peripheral neural apparatus. 
Biopsy specimens from McMurray’s 
patient showed ‘“‘nerve fibers and free 
nerve endings present. ... No mor- 
phological features that would distin- 
guish them from the pain endings of 
normal subjects” (Feindel, 1953, p. 
402). Other investigators who at- 
tempted histological studies (Girard, 
Devic, & Garin, 1953; Madonick, 
1954; Cohen et al., 1955) also found 


nerve fibers in apparently normal 
pattern. 

In many, if not all, of these cases, 
the evidence indicates that no dis- 
tinct localized damage exists in the 
central nervous system. _ Investi- 
gators who performed extensive neur- 
ological examination of their Ss 
(Boyd & Nie, 1949; Arbuse et al., 
1949; McMurray, 1950; Jewesbury, 
1951; Rose, 1953; Madonick, 1954; 
Jéquier & Deller, 1956) report that 
all tests were essentially normal— 
normal reflexes, normal skull and 
spine X ray, normal pneumoenceph- 
alogram, normal electroencephalo- 
gram, etc. Arbuse et al. (1949) have 
emphasized that there is no indica- 
tion in their case, or in any other re- 
ported case, of a lesion in any specific 
part of the brain. Most investigators 
who have examined these Ss appear 
to be in agreement with De Jong’s 
(1949, p. 411) conclusion that the de- 
fective reaction is more likely due to 
a “generalized or diffuse develop- 
mental anomaly” and that it is 
highly doubtful that any “‘local le- 
sions”’ exist. 

In at least three of the reported 
cases the pain insensitivity was not 
due to an irreversible “anomaly.” 
Ford and Wilkins’ (1938) first case 
appeared to be insensitive to pain 
and readily submitted to many seri- 
ous noxious stimuli in the laboratory 
without signs of discomfort; later, 
however, he seemed to be afraid of 
“getting hurt,” refused to have a 
tooth extracted without an anes- 
thetic, and generally appeared to be 
becoming more concerned about po- 
tentially pain-producing stimuli. 
Similarly, during the first 2} years of 
life, Jewesbury’s (1951) fourth case 
did not show any signs of pain re- 
sponsiveness to a wide variety of in- 
juries—serious burns, bruises, bleed- 
ing fingers, etc. At two years of age 
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he had been reported in the press as 
“the child who knows no pain.” 
However, when examined at 3} years 
of age he showed normal pain re- 
sponses to all nociceptive stimuli. 
Rose’s (1953) case also followed a 
similar pattern; Rose reports that 
“his sensitivity to pain is becoming 
progressively nearer the normal and 
he now feels the minor injuries of 
boy’s life as well as any other child.” 

Since each of the reported cases ap- 
pears to differ in-some way from 
every other reported case, we cannot 
generalize from the above data. How- 
ever, we are probably safe in tenta- 
tively concluding that some of these 
Ss are able to respond to at least some 
nociceptive stimuli in the normal 
manner, i.e., with the “sensation of 
pain,”’ discomfort, and alterations in 
some physiological functions, even 
though they almost always fail to do 
so. Also, many, if not all, of these Ss 
can discriminate and localize noxious 
stimuli and easily differentiate these 
stimuli from heat, warm, pressure, 
and touch stimuli. But this ‘‘sens- 
ing” of a noxious stimulus is not 
“painful’’; very rarely is it associated 
with unpleasantness or discomfort. 
As Critchley (1956, p. 742) has 
pointed out: ““The most remarkable 
feature in this syndrome is a typical 
lack of conformity between the feel- 
ing of pain as a discriminative quality 
of sensation, and the registration of 
distress, either overtly or automati- 
cally.”” In fact, the available evi- 
dence suggests that some, if not many, 
of these Ss resemble in their pain re- 
sponsiveness the hypnotic ‘“anal- 
gesic” S and the restricted and iso- 
lated animals studied by Melzack 
and Scott (1957) more than they re- 
semble those rare patients with le- 
sions of the afferent apparatus who 
are unable to discriminate a nocicep- 
tive stimulus. The former phenom- 
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ena, summarized below, also indicate 
that “pain” in the sense of discom- 
fort and suffering is not necessarily 
present when noxious stimuli are dis- 
criminated, differentiaved, and local- 
ized. 


THE EFrect oF EARLY ISOLATION 
ON THE PAIN RESPONSE IN 
THE ADULT 


Melzack and Scott (1957) have 
provided much needed data concern- 
ing the effect of early isolation on 
pain responsiveness in the mature or- 
ganism. These investigators reared 
10 dogs in isolation ‘rom puppyhood 
to maturity in special cages which 
drastically limited both their over-all 
experience and their specific experi- 
ence with nociceptive stimuli. Com- 
paring the behavior of these re- 
stricted dogs with the behavior of 12 
normally reared dogs, they report the 
following: 

(a) In general, the 10 restricted 
dogs failed to show adaptive and in- 
telligent responses to noxious stimuli. 
Many of the dogs made no attempt to 
avoid a pinprick, a flame, or an elec- 
tric shock stimulus. Although some 
of the restricted dogs did learn to 
avoid these stimuli, they required 
many more trials than the control 
animals. As long as two years after 
release from isolation, many of the 
restricted dogs continued to show 
maladaptive behavior when given 
noxious stimuli. The investigators 
conclude that “it appears that the 
requisite experience must come at the 
correct time in the young organism’s 
life. During later stages of develop- 
ment, the experience necessary for 
adaptive, well-organized responses to 
pain may never be properly ac- 
quired” (p. 159). 

(6) The restricted animals ap- 
peared to be unable to localize the 
source of the noxious stimulus. Not 
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only were the stimuli “not ‘per- 
ceived’ as coming from the experi- 
menter’’ but the dogs also appeared 
to be “unaware that they were being 
stimulated by something in the en- 
vironment” (p. 158). 

(c) Although the restricted ani- 
mals may have “‘felt”’ the nociceptive 
stimuli “in some way,” they rarely 
showed discomfort or suffering: 


Their reflexive jerks and movements dur- 
ing pinprick and contact with fire suggest 
that they may have “felt something’’ during 
stimulation; but the lack of any observable 
emotional disturbance apart from these reflex 
movements in at least 4 of the dogs following 
pinprick and in 7 of them after nose-burning 
indicates that the perception of the event was 
highly abnormal in comparison with the be- 
havior of the normally reared control dogs. . . . 
The results suggest that the restricted dogs 
lacked awareness of a necessary aspect of nor- 
mal pain perception; the “meaning” of physi- 
cal damage or at least threat to the physical 
well-being (p. 159). 


Additional investigations are 
needed to determine the validity of 
the following hypothesis suggested by 
this study: some components of the 
normal pain response (local reflex 
movements and “the sensation of 
pain’’) do not require prior experi- 
ence with noxious stimuli; other com- 
ponents of the pain response (localiz- 
ing the stimulus, purposive with- 
drawal movements, and discomfort- 
suffering) require previous experience 
with such stimuli. 


HYPNOTICALLY-INDUCED 
“ANALGESIA” 


The experimental evidence, sum- 
marized by Weitzenhoffer (1953), in- 
dicates that, when given appropriate 
suggestions to induce “analgesia,” 
some “‘good”’ hypnotic Ss do not show 
a pain response to some noxious stim- 
uli, that is, they do not give a verbal 
report of pain, they do not withdraw 
from the stimulus, they do not show 
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discomfort by wincing, tremor, or 
restlessness, and they do not show 
significant alterations in blood pres- 
sure, heart rate, pulse rate, or res- 
piration. Dynes (1932) reported that 
following pinprick during hypnoti- 
cally-induced ‘‘anesthesia’’ seven 
“trained”’ hypnotic Ss denied that 
the stimuli were painful, did not show 
withdrawal or facial flinch, and 
showed little or no disturbance in the 
normal rate and rhythm of respira- 
tory and cardiac activity. Subse- 
quently, Dynes’ Ss were asked (by 
someone other than the experi- 
menter) to “fake a trance”’ during the 
following experiment but not to 
“enter hypnosis.” In this situation, 
pretending they were in trance, they 
showed all of the normal responses 
to the nociceptive stimuli. In a sim- 
ilar study, Sears (1932) recorded the 
responses of seven “good’’ hypnotic 
Ss to a sharp steel point pressed 
against the leg for 1 sec. with a pres- 
sure of 20 oz. Suggestions of anal- 
gesia were given for the left leg and 
the right leg was employed as a con- 
trol. When the stimulus was applied 
to the ‘“‘analgesic’’ left leg, the Ss did 
not show facial flinch or variations in 
respiration and the increased pulse 
rate, which normally follows nocicep- 
tive stimulation, was significantly 
decreased. However, they did show 
these responses when the stimulus 
was applied to the “‘normal”’ right leg. 
In further control experiments, when 
the Ss were asked to inhibit all reac- 
tions to the noxious stimulation, all 
Ss showed alterations in pulse and 
respiration. Doupe, Miller, and Kel- 
ler (1939) have in general confirmed 
these findings, reporting that their 
hypnotic Ss showed a slight altera- 
tion in respiratory rhythm, no sig- 
nificant change in pulse rate, and no 
facial grimace when multiple pin- 
pricks were applied to the ‘“anal- 
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gesic’’ arm.* Brown and Vogel (1938) 
also found that three Ss showed less 
variability in blood pressure, pulse, 
and respiration when nociceptive 
stimuli (lancet, thumb tack, and 
water at 49° C.) were applied to the 
“anesthetic” limb than when the 
same stimuli were applied to the 
“normal control” limb. Although 
they conclude that “physiological 
reactions to moderate and mild sen- 
sory stimuli may be affected by sug- 
gestion in the hypnotic state and by 
imagination in the waking state,” it 
is not clear from their report to what 
degree these responses were affected. 

Although the experimental studies 
generally report either a complete 
lack or a significant decrease in vaso- 
motor and respiratory alterations fol- 
lowing nociceptive stimulation dur- 
ing hypnotically-induced ‘“‘anal- 
gesia,’’ they report completely con- 
tradictory results with the galvanic 
skin response (GSR). Some investi- 
gators (Georgi, 1921) found that the 
GSR to noxious stimulation was com- 
pletely eliminated during hypnotic 
“‘anesthesia"’; others (West, Niell, & 
Hardy, 1952) concluded that it is at 
times significantly decreased over the 
normal and at other times completely 
eliminated; and still others (Levine, 
1930; Barber & Coules, 1959) re- 
ported that. the GSR to noxious 
stimuli is not significantly altered 
after hypnotic suggestions of anal- 
gesia. However, an extensive group 
of investigations indicate that the 
GSR is the least specific of all the 
physiological responses which may 


* Doupe et al. also found that, in compari- 
son with the normal limb, the hypnotically 
“anesthetic” limb showed a reduced vaso- 
constrictor response to pinprick. They are 
uncertain, however, whether this “residual 
response”’ is “of the nature of a spinal reflex" 
or due to “sub-conscious or co-conscious ac- 
tivities” on the part of the S. 
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follow noxious stimulation. Although 
blood pressure alterations, for ex- 
ample, are at times present when S 
is responding to nonpainful stimuli, 
some variation in blood pressure ap- 
pears to be always present when S 
does “feel pain” (Nafe & Wagoner, 
1938; Goetzi, Bien, & Lu, 1951). This 
is not true of the GSR, however. An 
S may show a GSR when he does not 
“feel pain’’ and he may not show a 
GSR when he does “feel pain.” 
Brown and Vogel (1938) demon- 
strated that hypnotic Ss often showed 
a GSR when there was no doubt that 
they did not “feel pain,” that is, when 
noxious stimuli were applied to an 
area made insensitive by novocain 
bleck. They write that “light appli- 
cation of the pin point [to the area in 
which novocain had been injected] 

. appreciated as touch, caused 
large galvanometer deflections” 
419). Along similar lines, Levine 
(1930) and Barber and Coules (1959), 
using hypnotic Ss, and Sattler (1943), 
using nonhypnotic Ss, found that 
Ss often show a GSR when they are 
told they are to be given a pain- 
ful stimulus but.are not given the 
stimulus. West et al. (1952) found 
that (a) the GSR showed a significant 
decrease over the control levels for 
all seven of their hypnotic Ss even 
when “there was no alteration in 
pain perception, according to subjec- 
tive reports,”’ and (6) during the con- 
trol periods a stimulus “evoking a 
pain of 6 or 7 dols” at times failed to 
produce a GSR. After a careful, 
long-term investigation designed to 
determine the relationship of the 
GSR to the pain response, Furer and 
Hardy (1950) concluded that the 
GSR is directly related to the 
“threat-content” of the stimulus and 
is not related to the “sensation of 
pain” as such. Following Furer and 
Hardy’s interpretation, we can con- 


/ 
‘ p. 
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clude that studies which have re- 
corded the GSR during hypnotically- 
induced “‘analgesia’’ indicate that to 
some hypnotic “analgesic’’ Ss the 
noxious stimuli are ‘‘threatening,” 
to others they are less ‘‘threatening” 
than during the control period, and 
to still others they are not “threaten- 
ing” at all. 

However, we cannot draw any con- 
clusions from the above studies as 
to the effectiveness of hypnotic pro- 
cedures when the stimuli are more 
severe and of longer duration. For 
this type of report we must turn to 
the clinical investigations. In gen- 
eral, the clinical reports suggest that 
hypnotic methods, with some patients, 
may be as effective as morphine and 
other opiates in minimizing patho- 
logical pain syndromes and in miti- 
gating or totally eliminating the dis- 
comfort-suffering component of the 
pain response during a variety of 
surgical procedures. A typical report 
of the surgical use of hypnotic tech- 
niques is Mason’s (1955) discussion 
of a case of mammaplasty: during the 
operation, which consisted of exci- 
sion of breast tissue, skin, fat, and 
complete reshaping of the breast, the 
patient “never showed signs of pain 
or seemed distressed” and the pulse 
and blood pressure showed very lit- 
tle, if any, alteration. Kroger (1957) 
has also reported four cases which 
are more or less typical of the surgical 
findings. The first case, a 20-yr.-old 
female, had ‘‘a fairly large tumor’”’ re- 
moved from the right breast without 
preoperative or operative medica- 
tion. She showed “‘no indication of a 
pain reflex at any time”’ and she was 
“fully aware of the entire surgical 
procedure.”” Another patient, who 
underwent Caesarean section and 
hysterectomy with hypnotic “anes- 
thesia,” “‘experienced no subjective 
discomfort and conversed with every- 
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body in the operating room. She was 
fuily conscious and was able to watch 
the birth of her baby. There was no 
discomfort when the baby was de- 
livered by forceps, or when the uterus 
was extirpated.’"’ Other studies, re- 
cently summarized by Barber 
(1958b), also report that hypnotic 
methods are successful with some pa- 
tients in minimizing or completely 
eliminating the discomfort-suffering 
component of the pain response dur- 
ing childbirth, terminal cancer, fatal 
burns, dysmenorrhea, and other pain 
syndromes. 

It should be emphasized, however, 
that in the more severe and intracta- 
ble pain syndromes, such as terminal 
cancer and spinal cord injuries, hyp- 
notic methods are reported to mini- 
mize discomfort and suffering; rarely, 
if ever, are these procedures reported 
to completely eliminate the total pain 
response to the ever-present noxious 
stimulus in the patient’s body. Dor- 
cus and Kirkner (1948) found that al- 
though hypnotic methods could min- 
imize discomfort in five cases of spinal 
cord injury—i.e., the patients re- 
ported less pain and requested a 
smaller amount of drugs—these 
methods were by no means effective 
in entirely eliminating the pain re- 
sponse. Similarly, Butler (1954) re- 
ported that hypnotic methods were 
effective with some patients in min- 
imizing discomfort during terminal 
cancer—the patients either required 
half of their usual amount of mor- 
phine or, in a few cases, did not re- 
quire any drugs for a period of time. 

The evidence available at present 
indicates that two objections which 
have been raised concerning the ef- 
fectiveness of hypnotic procedures in 
“relieving pain’ are not valid. Hull 
(1933) was of the opinion that hyp- 
notic Ss may state, after the experi- 
ment, that they did not “feel pain” 
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during the experiment because ame 
nesia has been suggested and they 
simply do not remember. However, 
in a number of recent studies (Rosen, 
1951; Mason, 1955; Kroger, 1957) 
posthypnotic amnesia was not sug- 
gested and the patients continued to 
insist that they had not “felt pain” 
even when they were perfectly able 
to recall the entire procedure. Others 
have objected that hypnotic Ss ac- 
tually “feel pain” but deny it (when 
questioned by the hypnotist) be- 
cause of their rapport or strong 
“transference”’ relationship with the 
hypnotist. This also seems doubtful. 
Whenever any of the above patients 
were questioned afterwards by dis- 
interested observers, they continued 
to vehemently deny “feeling pain” 
during the procedure (Marcuse & 
Phipps, 1956; Kroger, 1957). 

Before we can state what are the 
necessary and sufficient conditions 
for hypnotic “analgesia,” we need 
more extensive, controlled experi- 
ments utilizing a wide variety of 
noxious stimuli applied to visceral, 
somatic, and cutaneous structures. 
Recent investigators, however, have 
emphasized three conditions which 
may be among the necessary < »ndi 
tions for this phenomenon. 

First of all, it seems that the S must 
be a certain type of person who is 
able to become ‘“‘deeply hypnotized.” 
With few, if any, exceptions, investi- 
tors agree that these individuals (us- 
ually termed somnambulists) are a 
small minority—5 to 25%—of the 
population, at least in our culture 
(Weitzenhoffer, 1953, p. 59; Butler, 
1954; Mason, 1955; Kroger, 1957). 
The limited evidence available at 
present suggests that these indi- 
viduals are characterized by a num- 
ber of distinct “abilities.” Young 


(1928, p. 372) found that one or more 
following 


of the characteristics 
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showed themselves in all of his ‘‘best”’ 
Ss long before they were “hypno- 
tized’: ‘“‘deep abstraction, reverie 
amounting almost to ecstasy, putting 
oneself to sleep at will, actually hyp- 
notizing one’s self.’’ Similarly, Bar- 
ber (1958b, 1958c) found that all of 
his somnambulistic Ss had been able 
since childhood to go to sleep easily 
and quickly at anytime—day or 
night—and to concentrate on their 
work or studies by “blocking-out”’ 
irrelevant stimuli. ‘ 

What appears to be a second neces- 

sary condition for hypnotic ‘“‘anal- 
gesia’’ has been formulated by Leuba 
(1957) as follows: 
“There must be concentration on the ideas 
presented by the hypnotist and with a mini- 
mum of counter or critical thoughts; and a 
belief that what the hypnotist says will hap- 
pen, can actually happen, and will happen. In 
other words, there must be a set or attitude to 
accept the hypnotist’s statements completely 
and uncritically” (p. 37). 


Along similar lines, Kroger (1957, p. 
xi) has concluded from his extensive 
experience with hypnotic procedures 
that “when one wishes to perform 
major surgery under hypnoanesthesia 

. it is very important to get the 
patient to believe in the actuality of 
the trance state."” Recent evidence 
indicates that these statements may 
be valid. Barber (1957b) reported 
that a somnambulistic hypnotic S 
(who quickly and easily carries out 
all of the “complex” hypnotic behav- 
iors such as analgesia, age-regression, 
negative and positive hallucinations, 
etc.) becomes ‘‘unhypnotizable”’ 
when he no longer “‘believes” in hyp- 
nosis, i.e., when he concludes from 
his own reading, or from training in 
autohypnosis,’ that the hypnotist 
does not possess any special power or 
ability and that whatever occurs dur- 


™Shor, R. E. 
October, 1957. 


Personal communication. 
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ing the hypnotic situation is brought 
about primarily by the subject him- 
self. A number of investigators have 
also reported that somnambulistic Ss 
(in the “‘deepest stage of hypnosis”’) 
do not carry out ‘‘complex’”’ hypnotic 
behaviors such as color blindness 
(Erickson, 1939), age-regression 
(Orne, 1951), immoral or dangerous 
behavior (Young, 1952), and nega- 
tive hallucinations (Barber, 1958a) 
when the hypnotist simply gives 
them the appropriate suggestions; 
however, they do carry out such be- 
havior when the hypnotist manipu- 
lates the situation in such a way as to 
lead the Ss to “believe” that the sug- 
gestions are literally true statements. 

A third factor which seems to be 
closely related to the above, has been 
recently emphasized by physicians 
attempting to relieve the pain of 
terminal cancer or childbirth by hyp- 
notic methods; the patient must have 
confidence in his physician-hypnotist 
and the hypnotist must “give of him- 
seif’’ to the patient. In treating the 
pain of terminal cancer by hypnotic 
procedures, Butler (1954) saw his pa- 
tients at least daily and often two to 
four times a day. Whenever hypno- 
therapy was terminated for any 
length of time, the patients all showed 
a return of the original pain syn- 
drome. However, in the few cases 
when hypnotic procedures were dis- 
continued but the patient received the 
same amount of personal attention 
from the physician, the patients did 
just as well for one or two days as 
they did during “hypnosis.” Butler 
emphasizes that in treating the pain 
of terminal cancer by hypnotic meth- 
ods the hypnotist-physician “‘gives of 
himself to the patients. ... Even an 


hour’s treatment with a very sick pa- 
tient can produce an appreciable tir- 
ing of the hypnologist, and, as the 
sympathetic bond between the two 


449 


grows stronger, the hypnologist may 
even ‘feel’ the symptoms he is trying 
to eradicate from the patient”’ (p. 6). 
Along similar lines, Winkelstein (1958, 
p. 154) concluded, after using hyp- 
notic methods over a 2-yr. period 
with 200 of his obstetrical patients, 
that ‘‘the mental attitude of the pa- 
tient, the patient-obstetrician rap- 
port, and the confidence of the pa- 
tient in the procedure as well as in 
the accoucheur, seemed to be as im- 
portant factors as was the hypno- 
suggestion itself.’’ 
In summary, the evidence available 
at the present time indicates that 
when the hypnotist properly manipu- 
lates the situation some ‘“‘good”’ hyp- 
notic Ss show a mitigated pain re- 
sponse to some noxious stimuli,’ that 
is, (a) they do not show withdrawal 
or avoidance, (6) they report that the 
stimuli are not painful, (c) they do 
not show discomfort and (d) they 
do not- show physiological responses 
such as vasomotor and respiratory 
alterations (although they may or 
may not show galvanic skin re- 
sponses). The evidence also suggests 
that Ss who are able to carry out the 
above are “‘set’’ to accept the hyp- 
notist’s suggestions as literally true 
statements and have complete confi- 
dence in the hypnotist and in the 
efficacy of hypnotic procedures.® 


® It should be emphasized that the hypnotic 
“analgesic” S, like the leucotomized, nar- 
cotized, or congenitally insensitive patient, is 
able to discriminate, differentiate, and localize 
the noxious stimulus when asked to do so 
(Rosen, 1951). Although he can “sense” the 
stimulus, it does not arouse discomfort. 

® As will be pointed out below, the “pain 
relief’’ which at times follows the administra- 
tion of a placebo is also closely related to the 
S's belief or conviction that the “drug’’ has 
curative properties.. Apparently, at least some 
of the effectiveness of hypnotic ‘analgesia’ 
is due to a “‘placebo effect.” 

However, the hypnotic ‘analgesic’ S also 
resembles the patient who has received mor- 
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THE INCONSTANT PAIN THREHSOLD 


Recent studies indicate that both 
morphine and placebos can eliminate 
discomfort and suffering without al- 
tering the pain threshold and without 
affecting ‘“‘the sensation of pain.” 
Since many of the studies on placebos 
and analgesic drugs are intimately re- 
lated with the pain threshold studies, 
we shall first review the latter investi- 
gations. 

When the subject of pain was last 
reviewed in this journal (Edwards, 
1950) it appeared that Hardy, Wolff, 
and Goodell (1940) had established 
that the pain threshold was relatively 
constant in the same S at diflerent 
times. Using what was later to be- 
come known as the Hardy-Wolff- 
Goodell radiant heat technique’® and 
using themselves as Ss, these investi- 
gators reported that when pain thresh- 
old measurements were taken almost 
daily for nearly a year, the average 
threshold value was 232 / 
cm.* with a standard deviation of 
only +9 millicalories. In addition, 
they 


me./sec./ 


reported that all observations 
were within +12% of the mean. It 
also appeared that the same workers 


(Schumacher, Goodell, Hardy, & 
Wolff, 1940) had established that a 
large group of untrained Ss had ap- 
proximately the same pain threshold. 
Theyreported that theaverage thresh- 


phine or other opiates. As pointed out below, 
morphine apparently gives “pain relief” by 
bringing about “freedom from anxiety” or “a 
bemused state.”” These terms also appear ap- 
plicable to the “good” hypnotic S who be- 
comes relatively unconcerned about and 
“relatively inattentive to all stimuli except the 
words of the operator and stimuli to which 
the operator specifically directs his attention” 
(Barber, 1957a). 

To what extent hypnotic “analgesia” is due 
to each of these seemingly different mecha- 
nisms is a subject for further research. 

1 For a detailed descrption of this method 
see Hardy et al. (1952, pp. 67-85). 
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old for 150 untrained Ss was 206 
mce./sec./cm.? with a standard devi- 
ation of only +21 millicalories and a 
range extending only from 173 to 
232 millicalories. Subsequent investi- 
gations, however, have failed to con- 
firm both of the above conclusions; 
it now appears that there is a wide 
variation in pain threshold among a 
group of Ss and that the threshold 
is by no means consistent in the 
same S over time. 

Using the Hardy-Wolff-Goodell ra- 
diant heat technique, Chapman and 
Jones (1944) found that the pain 
thresholds of 200 Ss varied between 
—40 to +50% of the mean and 
Kuhn and Bromiley (1951) reported 
that the pain thresholds of 37 Ss 
ranged from 169 to 296 millicalories 
with a standard deviation of 31.9. 
Hall and Stride (1954), using a modi- 
fied Hardy-Wolff-Goodell technique, 
found that the pain threshold of 400 
psychiatric patients (neurotics, de- 
pressives, and schizophrenics) ex- 
tended “over almost the whole range 
of stimulus intensity’”’ with the mean 
at 260 millicalories and a standard 
deviation of 72+45. The depressives 
and schizophrenics reported pain at 
a uniformly high level of stimulus 
intensity while the anxiety neurotics 
consistently reported pain at low 
stimulus intensities. Since the pain 
threshold, but not the warmth thresh- 
old, could be easily altered by vary- 
ing the instructions, Hall and Stride 
suggest that pain threshold varia- 
tions are due to “central attitude or 
pain-conceptualization and not to 
differences in peripheral sensitivity.” 
Five additional studies, using the Har- 
dy-Wolff-Goodell technique, which 
also failed to find consistency in the 
pain threshold have been recently re- 
viewed by Beecher (1957). 

Other workers using other methods 
have also found wide variability in the 
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pain threshold. Although Hardy et 
al. (1940) had reported that all 
threshold measurements of their 3 Ss 
(themselves) fell within +12% of 
the mean, Lanier (1943), using an 
electric shock stimulus, found that 
the threshold of 15 college women 
showed a variation around the mean 
of —80 to +300%. He also found 
that some Ss showed a relatively 
constant threshold while others 
showed wide variations in their pain 
threshold at different times. Clark 
and Bindra (1956), using thermal, 
electrical, and mechanical stimuli, 
have demonstrated wide individual 
differences in the pain threshold of 
46 untrained Ss. They attribute these 
variations to “‘attitudinal’’ variables 


such as the definition of pain, set, 
anxiety, and timidity. 

After reviewing the many investi- 
gations in this area, Beecher (1957, 
p. 128) writes that “a survey of the 
abundant literature on the subject 


presented above forces one to con- 
clude that the pain threshold is not 
constant from one individual to an- 
other nor even in a given individual 
from one time to another.” Simi- 
larly, Kutscher and Kutscher (1957) 
conclude, after reviewing the litera- 
ture, that the pain threshold varies 
widely among human beings, pro- 
vided that a sufficiently large group 
of Ss is tested. 

The second conclusion that appears 
to have been established by these 
investigations is that the pain thresh- 
old can be easily influenced by vary- 
ing the instructions (Hall & Stride, 
1954), by a wide variety of “dis- 
tractions,”’ and by placebos, anal- 
gesics, and hypnosis. Wolff and Good- 
ell (1943) had earlier demonstrated 
that placebos, in some cases, elevated 
the pain threshold as much as 95%, 
that the distraction caused by retain- 
ing and repeating from 5 to 9 digits 
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raised the threshold as much as 45% 
and that “shallow hypnosis” ele- 
vated the pain threshold by 40%. 
Subsequent work on the effects of 
analgesics and placeboson pain thresh- 
old will be discussed in the following 
section of this paper. 

Kutscher and Kutscher (1957) 
have noted that the pain threshold 
can be significantly influenced by 
the operator administering the test. 
A report by Denton and Beecher 
(1949) indicates that this observation 
may be valid. Having failed to find 
any consistent effect of analgesic 
agents on pain threshold in trained 
subjects, these investigators re- 
quested the service of an individual 
who had had wide experience with 
the Hardy-Wolff-Goodell method. 
They found that this Operator re- 
ported consistent elevations in the 
threshold, after the administration 
of an analgesic drug, when he knew 
which drug—a placebo or an anal- 
gesic—had been administered; how- 
ever, when he did not know whether 
an effective drug or a placebo had 
been administered, he was unable to 
report any consistent threshold ele- 
vation. 

That the pain threshold can be 
readily influenced by a wide variety 
of factors is not surprising if we stop 
to consider that determination of the 
human pain threshold does not sven 
remotely resemble the determination 
of threshold responses of nerve fibers, 
nerve trunks, or other isolated physi- 
ological units; determination of the 
human pain threshold obviously re- 
quires judgment or interpretation on 
the part of the S. The S must inter- 
pret the stimulus in accordance with 
his concept of pain and interpreta- 
tion clearly depends on S’s previous 
life-history and especially his specific 
history in responding to similar or re- 
lated stimuli. In fact, the Hardy- 
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Wolff-Goodell method requires more 
than that S simply judge when he 
first becomes aware of a stimulus; 
he is required to determine when the 
stimulus first undergoes a qualitative 
change. Operationally, the Hardy- 
Wolff-Goodell “pricking pain thresh- 
old” refers to the S’s judgment that 
the feeling of warmth and heat has 
“swelled” and “drawn together” into 
a ‘“‘very small” and “barely percepti- 
ble prick’”’ at the “exact end of the 
3-sec. exposure to the stimulus” 
(Hardy et al., 1952, p. 81). This 
“pricking” feeling must be inter- 
preted by S as different not only 
from the warmth and heat which pre- 
cede it but also from the “burning”’ 
which may be simultaneously pres- 
ent. It would indeed be suprising if 
such an intricate judgment could not 
be influenced by a wide variety of 
factors, 


THe EFFECT OF OPIATES 
ON THE PAIN RESPONSE 


As pointed out above, morphine 
and other opiates give ‘‘pain relief” 
without necessarily altering the pain 


threshold. Although Wolff, Hardy, 
and Goodell (1940) reported that the 
pain threshold is consistently ele- 
vated after morphine, subsequent in- 
vestigations failed to confirm this 
conclusion. Andrews (1943), Chap- 
man and Jones (1944), Denton and 
Beecher (194°), Isbell (cited by 
Wikler, 1950), Javert and Hardy 
(1951), and Kuhn and Bromiley 
(1951) found that after an analgesic 
dose of morphine the pain threshold 
may be elevated, may be lowered, 
or may remain unchanged. 

A related line of research com- 
paring placebos and analgesic drugs 
arrived at similar results. Denton and 
Beecher (1949) found, using the 
Hardy-Wolff-Goodell method, that 
a placebo had the same effect on 
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pain threshold as an analgesic dose 
of morphine. Similarly, Birren, 
Schapiro, and Miller (1950) reported 
that a placebo (lactose) had the same 
effect on pain threshold as 0.6 gm. 
of acetylsalicylic acid and sodium 
phenobarbital. Isbell (cited by Wik- 
ler, 1950) also found no significant 
difference in the effect on the pain 
threshold when Ss received morphine 
and when they received a placebo 
but were told they were being given 
morphine. Beecher (1957) has re- 
viewed 10 additional investigations 
which also indicate that morphine 
and other opiates (a@) do not neces- 
sarily elevate the pain threshold when 
they give “pain relief’ and (0) if and 
when they do elevate the pain thresh- 
old, they do so to the same extent 
and possibly in the same way as 
placebos. , 

The evidence also indicates that 
opiates can give “‘pain relief’’ with- 
out altering “the awareness of pain”’ 
or “the pain sensation.”” Cattell 
(1943) has summarized the data in- 
dicating that ‘‘awareness of pain’’ is 
not necessarily altered by narcotics. 
Wolff et al, (1940, p. 677) have em- 
phasized that after morphine admin- 
istration “‘the pain sensation is per- 
ceived and is recognized as pain with 
no difficulties.” Apparently, “the 
sensation of pain,” in itself, is not 
necessarily “‘painful.’’ ‘The sensa- 
tion of pain’’ may be completely un- 
affected by morphine (and placebos, 
hypnosis, prefrontal leucotomy, etc.) 
and yet discomfort and suffering are 
no longer present. 

The ‘‘pain relief,” i.e., the mitiga- 
tion of discomfort and suffering, 
which follows the administration of 
morphine and other opiates appears 
to be one component of a more gener- 
alized effect on the patient which has 
been variously conceptualized as 
“freedom from anxiety,” ‘‘content- 
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ment,”’ and “‘a bemused state.”’ This 
viewpoint is perhaps best epitomized 
by Beecher (1957, p. 152) who writes 
after extensive clinical and experi- 
mental experience with analgesic 
drugs that “perhaps one can con- 
clude that the narcotics really alter 
pain perception very little but do 
produce a bemused state, comparable 
to distraction, which they can be 
‘alerted out of’ and will then report 
on the little altered pain perception.” 
Along similar lines, Hill, Kornetsky, 
Flanary, and Wikler (1952a) have 
hypothesized that the “pain relief” 
following morphine administration is 
a consequence of a more generalized 
effect which they term relief of 
“anxiety” or “fear of pain.” They 
tested this hypothesis by studying 
the effect of subcutaneous injection 
of 15 mg. of morphine on S’s ability 
to judge the intensity of electric 
shock stimuli under two conditions: 
(a) when Ss were made “anxious” 
by not “familiarizing them with the 
potentially fearinspiring experimental 
situation,” and (6) when “anxiety” 
was allayed by “reassurance, demon- 
stration, and explanation.” They 
reported the following: 

(a) Under conditions which promote anx- 
iety or fear of pain, subjects tend to overesti- 
mate the intensities of painful stimuli; (6) 
morphine reduces such anxiety; (c) under 
conditions in which anxiety is largely elimi- 
nated, little if any overestimation of the in- 
tensities of painful stimuli occurs; (d) mor- 
phine does not affect the ability of subjects to 
accurately estimate the intensities of painful 
stimuli when anxiety is dissipated (p. 479). 


Corroborative data were obtained 
in another study by the same group 
of investigators (Hill, Kornetsky, 
Flanary, & Wikler, 1952b). In an 
additional follow-up experiment, us- 
ing thermal stimuli, Kornetsky (1954) 
also confirmed these results and con- 
cluded that morphine appears to be 
effective as an analgesic agent only 
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when “anxiety” is present. 

In summary, the investigations on 
narcotics suggest a similar conclusion 
as the investigations on prefrontal 
leucotomy and hypnosis which were 
summarized in an earlier section of 
this paper: discomfort and suffering 
are not inevitably associated with 
noxious stimulation; they appear to 
be components of a secondary “reac- 
tion to’ the stimulus (which has 
been conceptualized as ‘‘anxiety”’ or 
“fear of pain”) which can be mini- 
mized or eliminated by opiates hyp- 
nosis, placebos, prefrontal leucotomy, 
and other procedures. 


THE PLACEBO EFFECT 


The effect of placebos on the pain 
response deserves further comment. 
Jellinek (1946) reported that 60% of 
199 patients with chronic headaches 
received “relief’’ from a placebo on 
one or more occasions. In extensive 


studies of severe, steady, postopera- 


tive wound pain, Beecher (1955) and 
his collaborators (Lasagna, Mosteller, 
von Felsinger, & Beecher, 1954) found 
that about 35% of their patients re- 
ceived “‘satisfactory” relief from a 
placebo." (‘‘Satisfactory relief’’ is 
defined by these workers as “50 per 
cent or more relief of pain at 45 and 
90 minutes after the administration 
of the agent.”) Houde and Wallen- 
stein (1953) and Keats (1956) have 
carried out similar studies and have 
confirmed the findings »f the Beecher 
group. 

How does a placebo relieve chronic 
headache or minimize the suffering 


This finding does not indicate that pla- 
cebos are only 35% as effective as morphine. 
Morphine, in maximum safe dosages, results 
in “satisfactory” postoperative pain relief in 
only 75% of the same group of patients 
(Lasagna & Beecher, 1954). The placebo, 
therefore, is about half as effective as mor- 
phine in the same situation and among the 
same patients. 
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associated with a postoperative 
wound? As a first approximation to 
an answer, it seems difficult to dis- 
agree with Wolf's (1950, pp. 106-108) 
conclusion: 

The above “placebo” actions depended for 
their force on the conviction of the patient 
that this or that effect would result. . . . The 
fact that “placebo effects” occur depends, of 
course, on the generalization established re- 
peatedly by numerous workers that the 
mechanisms of the human body are capable of 
reacting not only to direct physical and chemi- 
cal stimulation but also to symbolic stimuli, 
words and events which have somehow 
aquired special meaning for the individual. 


In general, the above studies and 
the many other studies on the effects 
of placebos on physiological func- 
tions and in psychotherapeutic situa- 
tions, reviewed by Beecher (1955), 
Rosenthal and Frank (1956), and 
Kurland (1957) indicate that the 
placebo reactor is responding to a 
“drug” which he believes has cura- 
tive properties. This belief appears 
to be a function of many factors: 
what the physician specifically tells 
the patient about the “drug,” the 
patient’s previous experience with 
drugs, his previous experience with 
physicians, his specific experience 
with the physician giving him the 
“drug,” etc. The placebo response 
may be viewed as a direct function 
of “the stimulus’; however, “the 
stimulus” is not the ineffective, inert 
compound but the entire situation 
which includes the “‘drug,”’ the words 
of the physician, and the patient’s 
previous experience with physicians 
and drugs. 

Placebo research is still in its in- 
fancy. As Kurland (1957) has pointed 
out, the effect of the placebo is 
usually stated in general terms, the 
duration of reactivity is usually not 
specified, and specific physiological 
measures are rarely reported. Studies 
of the differential effect of placebos 
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are also rare. Are some individuals 
more prone to respond to placebos? 
Lasagna et al. (1954) studied 27 post- 
operative patients with the Ror- 
schach, the TAT, the Wechsler-Belle- 
vue Vocabulary Subtest, and a ques- 
tionnaire filled out by the nurses on 
the wards. The 11 consistent placebo 
reactors differed from the 16 patients 
who never received pain relief from 
a placebo in a number of character- 
istics, among which were the follow- 
ing: 

The reactors were more productive of re- 
sponses, more anxious, more self-centered and 
preoccupied with internal bodily processes, 
and more emotionally labile. They are indi- 
viduals who seem more dependent on outside 
stimulation than on their own mental proc- 
esses. These processes tend to be less mature 
than in the case of the non-reactors. The 
reactors are in general individuals whose in- 
stinctual needs are greater and whose control 
over the social expression of these needs is 
less strongly defined and developed than in 
the non-reactors . . . (p. 775). 


However, Wolf, Doering, Clark, 


and Hagens (1957) contradict these 
conclusions: finding that intra-indi- 
vidual variations in response to place- 
bos are as great as interindividual 
variations, they conclude that the 
placebo reactor cannot be predicted 
from a knowledge of the S’s char- 


acteristics. An interesting field of 
research has been opened for further 
inquiry. 


CONCLUSIONS 


The investigations summarized 
above suggest the following conclu- 
sions which may be significant for a 
theory of pain: 

1. The generally accepted view, 
that “‘pain”’ has its ““own”’ peripheral 
receptors and its “own’’ pathways 
in the central nervous system, is mis- 
leading. Nociceptive stimuli activate 
various types of nerve fibers which 
travel in more than one pathway in 
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the spinal cord and brain stem and 
which project by thalamic and extra- 
thalamic pathways to wide areas of 
the cortex. 

2. The response to a nociceptive 
stimulus is apparently brought about 
when a spatiotemporal pattern of 
neural activity set off by the noxious 
stimulus reaches segmental and supra- 
segmental centers. The pattern of 
neural impulses set off by noxious 
stimuli differs from the neural pat- 
tern set off by other stimuli in that 
the relative number of fibers of differ- 
ent sizes activated differ, and the 
relatively different fibers activated 
carry impulses of different energy 
value, of different frequency, and of 
different duration. 

3. “Pain” in the sense of discom- 
fort and suffering is mot necessarily 
present when noxious stimuli are 
discriminated, differentiated, and lo- 
calized. The few cases which have 


been reported of “‘congenital insensi- 
tivity to pain’”’ suggest that an indi- 


vidual may be able to “sense” a 
noxious stimulus—i.e., may be able 
to discriminate and localize the stimu- 
lus and differentiate it from other 
stimuli—and yet not show with- 
drawal movements, physiological al- 
terations, or discomfort. Also, dis- 
comfort and suffering can be mini- 
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mized or totally eliminated in some 
Ss by placebos, opiates, prefrontal 
leucotomy, and hypnotic procedures 
without necessarily altering the ‘“‘sen- 
sation of pain” or elevating the pain 
threshold. 

4. The mitigation of discomfort- 
suffering by prefrontal leucotomy, 
opiates, and, to some extent, hyp- 
nosis, appears to be secondary to 
a more generalized effect of these 
procedures. Prefrontal leucotomy 
“alleviates worry and concern”’ and 
“relieves anxiety’’; morphine gives 
“freedom from anxiety” and brings 
about “contentment”? and “a _ be- 
mused state’; the hypnotic S is re- 
lieved of pain when he becomes 
“relatively inattentive and uncon- 
cerned about all stimuli to which the 
hypnotist does not specifically direct 
his attention.’’ These terms appear 
to refer to a common behavioral ma- 
trix: a mitigated “readiness to re- 
spond”’ to stimulation. Apparently, 
discomfort and suffering follow noci- 
ceptive stimulation when the S “‘at- 
tends to” and “reacts to” the stimu- 
lus. Minimize this readiness to re- 
spond and “the sensation of pain” is 
ne longer ‘‘painful’’; it can become an 
isolated ‘“‘sensation’’ unaccompanied 
by discomfort. 
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TOWARD STRENGTHENING THE CONTINGENCY TABLE 
AS A STATISTICAL METHOD! 
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Few statistical methods are better 
known to research workers in the be- 
havioral sciences than the simple, or 
two-way, contingency table with its 
corresponding chi square test for as- 
sociation between the two classifica- 
tions. It has served and will serve a 
unique function in research by facili- 
tating the analysis of data, either 
when we have little knowledge of the 
underlying quantitative properties or 
rather when we deliberately choose 
the method for a cursory analysis. 


Also, there are, perhaps, instances © 


when it provides the only method of 
analysis. Kelley (1947, p. 311) has 
listed the conditions under which 
data are placed in_ categories: 
(1) when a quantitative relationship 
in classes is not known to exist; 
(2) when the quantitative relation- 
ship is only vaguely surmisable; and 
(3) when the known quantitative re- 
lationship between classes is neg- 
lected because the more primi- 
tive and simple qualitative methods 
would seem to suffice.” 

Despite the widespread use of con- 
tingency analysis during its first four 
decades, the method until recently 
was applicable only to limited kinds 
of research problems and data. Since 
World War II many contingency 
techniques have been developed. It 
is the purpose of the present paper to 
describe some of these techniques 
briefly and to show how they over- 
came problems which have limited 


1A preliminary version of this paper was 
read before the Division on Evaluation and 
Measurement (Div. 5) of APA in New York, 
September 1957. 


the usefulness of the contingency 
method. Problems to be considered 
are concerned with such aspects as 
small samples, indices of relationship, 
specification of hypotheses, higher- 
order interactions, and computation- 
al procedures. 


SMALL SAMPLES 

One problem arises in analysis of 
contingency tables when the data 
constitute a “small sample.” The 
statistical theory presupposes “large 
samples’ which are not always con- 
veniently available. The definition 
of ‘‘small sample” has been arbitrary. 
Karl Pearson suggested that any ex- 
pected cell frequency below 10 is 
small, while Fisher set 5 as the limit. 
The typical solution to this problem 
has been to pool rows or columns so 
as to eliminate small expected fre- 
quencies or to use Yates’ correction 
(Yates, 1934) for a 2X2 table. 

Cochran (1952) believes that no 
rule of thumb is entirely adequate, 
and he indicated that, in a 2X2 
table, the magnitude of all four ex- 
pected frequencies affect the quality 
of the approximation. His paper 
clarifies the choice of tests when one 
has small samples or small cell fre- 
quencies. Since his rules shed new 
light on proper decisions to be made 
in analyzing small sample data and 
since they have not been readily 
available to behavioral researchers, 
they are given here in full. 


Summary Recommendations for the 
Use of X* 

I. Attribute data. 
to us in grouped form. 


The data comes 
Pooling of 
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classes is considered undesirable be- 
cause of loss of power. 

1. The 2X2 table. Use Fisher's 
exact test (a) if n<20, (b) if 20<n 
<40 and the smallest expectation is 
less than 5. Mainland’s tables. . 
are helpful in all such cases. If 
n>40, use X?, corrected for conti- 
nuity if the smallest expectation is 
less than 5. 

2. Tables with degrees of freedom 
between 2 and 60 and all expecta- 
tions less than 5. If m is so small that 
Fisher's exact test can be computed 
without excessive labor, use this. 
Otherwise use X?, considering wheth- 
er this needs correction for-continuity 
by finding the next largest value of 
X?. 

3. Tables with degrees of freedom 
greater than 60 and all expectations 
less than 5. Try to obtain the exact 
mean and variance of X* and use the 
normal approximation to the exact 
distribution. 

4. Tables with more than 1 df and 
some expectations greater than 5. 
Use X?* without correction for con- 
tinuity. 

Il. Continuous data. The data 
must first be grouped. Use enough 
cells to keep the expectations down 
to the levels recommended by Wil- 
liams (12 per cell for n=200, 20 per 
cell for »=400, 30 per cell for 
n=1,000). At the tails, pool (if nec- 
essary) so that the minimum expecta- 
tion is 1. 

It should be noted that Cochran’s 
n is the total number of cases in the 
contingency table. The symbol X? is 
what Cochran has suggested for the 
value of chi square obtained by sub- 
stituting empirical data in the formu- 
la. The Greek symbol ‘‘x’” is reserved 
for the tabled values given by the 
theoretical distribution. 

Fisher (1946) has shown how a 
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test of significance using the exact 
probability of an observed table and 
certain other configurations of cell 
frequencies can be applied to a 2X2 
table with small or zero frequencies. 
Freeman and Halton (1951) have ex- 
tended the principle to any number 
of attributes and any number of cate- 
gories per attribute. In general, both 
methods consist of assuming the bor- 
der totals fixed, considering only rela- 
tionships internal to the contingency 
table, considering every possible ar- 
ray of cell frequencies with the given 
border totals, and applying a test of 
significance as follows: (a) all arrays 
subject to the same general condi- 
tions as observed (i.e., the same bor- 
der totals); (6) the corresponding a 
priori probabilities are calculated by 
means of the appropriate probability 
expression; (c) the values of the a 
priori probabilities smaller than or 
equal to the probabilities of all ar- 
rays which are a priori as probable 
as, or less probable than, the ob- 
served array; (d) all probabilities 
satisfying the conditions in (c) are 
summed to yield the probability of 
obtaining an array as probable as or 
less probable than the observed ar- 
ray. 

Fisher's technique has appeared in 
a number of textbooks to date, but 
the technique of Freeman and Hal- 
ton seems not to have caught on. 
The computational labor in either 
technique is tedious, since it involves 
the quotient of the products of sets 
of factorials. However, by using ap- 
propriate tables which are based up- 
on Fisher’s formula and which yield 
the approximate significance level for 
2x2 data of various sample sizes 
and marginal totals, one may save 
much of the labor. Such tables rang- 
ing collectively up to sample sizes 
of 50 have been published by Arm- 
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sen (1955), Finney (1948), Latscha 
(1953), and Mainland (1948). 


INDICES OF RELATIONSHIP 


We sometimes wish to express the 
degree of association in a two-way 
contingency table with a significant 
chi square. We would prefer a coeffi- 
cient similar to that for product- 
moment correlation. Some problems 
have arisen in attempts to develop 
such indices. Some that have been 
deviged are the coefficient of mean 
square contingency, the phi coeffi- 
cient, tetrachoric correlation, the 
point-biserial coefficient, the coeff- 
cient of association, and the coeffi- 
cient of colligation. 

Inferences about the degree of as- 
sociation based upon the numerical 
size of a coefficient can be mislead- 
ing. A number of authors (Guilford, 
1936; Johnson & Jackson, 1953; Ken- 
dall, 1947) have called attention to 
the fallacies of such inferences. In 
general, the coefficients often fail to 
satisfy the desiderata of Kendall 
(1947, p. 310), namely, that “‘(a) it 
shall vanish when the associations 
are independent; (b) it shall be +1 
when there is complete positive as- 
sociation and —1 when there is com- 
plete negative association; (c) it 
should increase as the frequencies 
proceed from dissociation to associ- 
ation.”” In using these indices, one 
does not have the same intuitive feel- 
ing for strength of relationship as in 
the case of product-moment corre- 
lation. The latter has a stable, com- 
prehensive frame of reference to aid 
in the interpretation. Also, the 
sampling distribution of the statistic 
is well known. 

Kendall (1947) pointed out that 
there is a distinction between the 
concepts of association and correla- 
tion. For example, it is possible to 
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have strong association between two 
variables, when the correlation be- 
tween them is zero. The discrepancy 
comes from the fact that the two 
types of conclusions arise from two 
types of hypotheses. In testing for 
association, we consider all types of 
departures from independence; in 
testing for correlation we consider 
a much more limited kind of alterna 
tive hypothesis. 

All of the indices previously men- 
tioned, other than product-moment 
correlation, are subject to criticism. 
Let us single out one of them to illus- 
trate some weaknesses of such de- 
vices. The coefficient of mean-square 
contingency, C, is purported to be 
comparable to the Pearson product- 
moment correlation coefficient, 1, 
under some conditions. Guilford 
(1936, p. 357) has pointed out that 
C becomes identical with r under the 
following conditions: “(a) The vari- 
ables are of the continuous type; 
(b) N is large; (c) the number of 
classes is sufficient to overcome errors 
of grouping; and (d) the distribu- 
tions are normal.”” A number of 
weaknesses can be pointed out in at- 
tempts to meet these assumptions. 
Regarding (a), some data are either 
incapable of expression in terms of 
continuous scales or are difficult to 
quantify. Indeed, some data are in- 
herently qualitative and by defini- 
tion are nonquantifiable. Regarding 
(b), it is not always feasible to collect 
a large sample. Regarding (c), the 
number of classes used in practice 
will rarely be large enough so that C 
approaches 1.00 (the upper limit of 
the product-moment correlation co- 
efficient). One formula yields the 
value of the upper limit of C for a 
tXt table. For example, it has been 
shown that in a 2X2 table, C cannot 
exceed .707; in a 4X4 table, C can- 





464 


not exceed .866; in a 5X5 table, C 
cannot exceed, .894; and in a 1010 
table, C cannot exceed .949. Regard- 
ing (d), it is not always convenient 
or even possible to have both the dis- 
tributions normal. 

Rather than computing the mean- 
square contingency, one could con- 
solidate rows and columns of a larger 
table into a 2X2 table and compute 
any of several indices for a fourfold 
table. However, information is thus 
wasted, and the indices themselves 
leave something to be desired. 

One new solution for the problem 
is the use of scores for rows and col- 
umns and the calculation of regression 
coefficients and correlation coeffi- 
cients from such scores (Cochran, 
1954; Yates, 1948; Williams, 1952). 
Such techniques result in more sensi- 
tive tests for alternative hypotheses. 
The general procedure is to first test 
the hypothesis of independence by 
the over-all chi square test. If the 


hypothesis is rejected, and one con- 


cludes that association does, in fact, 
exist, he may use scoring methods to 
partition out the portion of the as- 
sociation which may be explained by 
correlation or regression. It is easiest 
to explain association as being due to 
the linear correlation of two under- 
lying variates. In some cases, it may 
be necessary to resort to one or more 
additional pairs of variates as mani- 
fested by new sets of row and column 
scores. 

Two kinds of scores are feasible for 
contingency tables, arbitrary, or a 
priori, scores, and empirical scores. 
Arbitrary scores are chosen along 
some convenient scale according to 
a knowledge of the kind of data it- 
self. The use of such scores serves to 
make some persons uncomfortable. 
However, Cochran (1954, p. 436) de- 
fends the use of arbitrary scores when 
they have embodied “the best insight 
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available about the way in which the 
classification was constructed and 
used.”” Furthermore, he pointed out 
that ‘‘any set of scores gives a valid 
test, provided that they are con- 
structed without consulting the re- 
sults of the experiment.”’ He goes on 
to say that “If the set of scores is 
poor, in that it badly distorts a nu- 
merical scale that really does under- 
lie the ordered classification, the test 
will not be sensitive.” 

Empirical scores are derived from 
the data themselves by statistical 
computation so that the correlation 
between row and column scores is a 
maximum. Such scores are optimal 
in the sense that they yield the high- 
est value possible for any set of 
scores that could be chosen. Thus, 
they are an improvement over arbi- 
trary scores. The procedures for cal- 
culating empirical scores grew out of 
work begun by Fisher (1946) who 
considered contingency tables from 
the point of view of discriminant 
analysis. Suppose that we wish to as- 
sign scores to rows and columns. 
What are the best scores to assign so 
that a linear function of row and col- 
umn scores will best differentiate the 
classes determined by the columns, 
and vice versa? This turns out to be 
a problem in maximizing the corre- 
lations between the scores, and the 
required correlations are those known 
as “canonical” in the sense of Hotel- 
ling (1936). Work in this area was 
continued by Maung (1941). The 
methods were applied to a practi- 
cal problem of quantifying letter 
grades of college students by John- 
son (1950). Bock (1957) showed that 
the empirical scoring scheme of Wil- 
liams is similar theoretically to the 
techniques of Maung (i941), John- 
son (1950), and Guttman (1941). He 
gives the name “optimum scaling”’ 
to the general theory and shows rela- 
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tively simple computational proce- 
dures for solving the necessary ma- 
trices. His approach to scaling is par- 
ticularly appropriate if the data are 
to be used in analysis of variance or 
multivariate analysis. 

Several of the above references 
(Bock, 1956; Bock, 1957; Cochran, 
1954; Fisher, 1946; johnson, 1950; 
Williams, 1952) have described ways 
of testing concordance of scores. In 
testing concordance, one asks two 
questions: First, of the total depar- 
tures from expectation, how much 
can be explained by a set of scores, 
and second, how much is not ex- 
plained by the set of scores? Thus, 
we may discover whether any given 
set of scores is sufficient to explain 
a significant amount of the associa- 
tion between two classifications and 
whether only one set of scores is suf- 
ficient. In answering these ques- 


tions, we have access to partitioning 
of chi square or analysis of variance 


techniques. 

In summary, arbitrary scores can 
be useful in instances in which one is 
familiar with the underlying quanti- 
tative basis of the classifications and 
when one wishes to save computa- 
tional labor at some loss in accuracy; 
on the other hand, empirical scores 
might be used when one is unfamiliar 
with the underlying quantitative 
basis or wishes a more accurate set of 
scores. 

Illustrations of scoring techniques 
as applied to data in the behavioral 
sciences have been given by Mayo 
(1957, 1958). 

Stuart (1953) devised a correlation 
coefficient for two-way tables which 
is a variety of Kendall's tau. His 
formulas, however, do not require 
scores for rows and columns as do 
those of Cochran, Yates, and Wil- 
liams. Rather, his coefficient de- 
pends only on ordinal properties and 
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was offered as a device to measure 
strength of association. He showed 
how the existing theory vf the coef- 
ficient may be used to estimate the 
population association, to set confi- 
dence limits for it, and also to test 
the differences in the coefficients cal- 
culated for two contingency tables. 
A general review of the methodol- 
ogy of measures of association for 
contingency tables with two or more 
attributes, together with a clarifica- 
tion and discussion of some of the 
underlying concepts was given by 
Goodman and Kruskal (1954, 1959). 


SPECIFICATION OF HYPOTHESES 
UNDER TEST 


The usual chi square test of inde- 
pendence between two classifications 
can be very useful in the exploratory, 
or pilot, stages of research, when one 
does not or cannot specify the alter- 
native hypotheses. For nonsignifi- 
cant results, one probably would not 
inquire further; however, once signifi- 
cance is demonstrated, it is well to in- 
quire as to what alternative hypothe- 
ses might be plausible and to test 
these empirically. For example, if 
one were interested in explaining as- 
sociation by assuming linear correla- 
tion between two underlying quanti- 
tative variates, the coefficient of 
mean-square contingency would be 
misleading. This coefficient is based 
upon obtained chi square which sub- 
sumes all forms of departure from 
expectation, rather than departures 
due to linear correlation which are 
only one kind of departure. The 
scoring techniques previously men- 
tioned constitute one solution to the 
problem of testing more specific al- 
ternative hypotheses. In addition to 
testing for significant regression co- 
efficients and for correlation, one 
may also test for homogeneity of 
row or column means, and for differ- 
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ences between pairs of row or column 
means. The analogy to the ¢ test and 
analysis of variance is apparent here. 

Distinction among three different 
kinds of sampling processes which 
might have yielded the same 2X2 
table of data was made by Barnard 
(1947) and Pearson (1947). These 
authors maintained that the abstract 
configuration of a given 2X2 table 
could have meaning for at least three 
different classes of empirical data, 
depending upon the sampling proc- 
ess and the assumptions. Barnard 
used three kinds of “urn experi- 
ments’ as models and called the 
three classes the (a) 2X2 Independ- 
ence Trial, (6) 2X2 Comparative 
Trial, and (c) Double Dichotomy. 
He also maintained that Fisher's ex- 
act formula appiied only to the 2X2 
Independence Trial and presented 
different formulas for the two classes. 
Pearson designated the first two 
classes of Barnard as Problem I and 
Problem II, respectively. It was 
pointed out, however, that for large 
samples, the results tend to approach 
each other asymptotically. A discus- 
sion of the theory of the power func- 
tion, computational formulas for the 
function and some illustrative tables 
of the function were presented for 
Problem I by Pearson and Merring- 
ton (1948) and for Problem II by 
Patnaik (1948). 

Cochran (1950) has given the 
theory and application of a test of 
significance for a 2X2 contingency 
table in which there is correlation be- 
tween the observations themselves in 
the cells. Such a situation occurs in 
practice when the same individuals 
are observed under different treat- 
ments. 

Snedecor (1946) has described and 
illustrated a technique of testing the 
discrepancies of several 2 X 2 samples 
which represent replications of the 
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same experiment in regard to the evi- 
dence which they furnish regarding 
the hypothesis under test. His illus- 
tration was for a single attribute with 
expected frequencies given a priori. 


HIGHER-ORDER INTERACTIONS 


Interactions among three or more 
classifications are often of intrin- 
sic interest in contingency analysis. 
However, most of the development 
of concepts for understanding and 
techniques for analyzing higher-order 
interactions has come about rather 
recently. 

Simpson (1951) pointed out the 
possibilities for fallacious conclusions 
that might be drawn from data in 
two-way form where certain effects 
are obscured, while if the data had 
been classified in a three-way form, 
the covert relationships would have 
shown up. The same effect applied to 
test item interactions has been called 
““Meehl’s Paradox”’ by Fricke (1956). 
The effect also appeared in a descrip- 
tion of latent structure analysis by 
Lazarsfeld (1954). The analogue for 
continuous variables is the well 
known argument against doing sepa- 
rate ¢ tests rather than a single fac- 
torial experiment. 

At present, there are available 
three approaches toward the assess- 
ment of higher-order interaction. It 
is not clear just how these three are 
related, or whether they are independ- 
ent approaches to the same _ prob- 
lem. There appears to be much to be 
done by both mathematical statis- 
ticians and by the applied researcher 
in clarifying the distinctions among 
these three techniques. 

The simplest case of higher-order 
interaction is that given in the 
2X22 table. A test for this 
case was first described by Bartlett 
(1935), who wrote the general formu- 
la for a cubic equation in which the 
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independent variable x is the devia- 
tion of each observation in the 
2X22 table from the corresponding 
expected values. Having solved the 
cubic equation for x, it was easy to 
compute chi square by means of a 
formula which he gave. Bartlett also 
treated more complex tables of more 
than three dichotomized attributes. 
Norton (1945) maintained that the 
calculation of chi square for the com- 
plex table of dichotomized attributes 
is purely a computational difficulty. 
He presented the algebraic model 
and an approximate method of com- 
puting chi square for such a table. 
However, Norton’s method is com- 
putationally tedious. Kastenbaum 
and Lamphiear (1959) demonstrated 
an iterative technique for solving the 
general three-way table which, while 
practical for a desk calculator, is par- 
ticularly well suited for modern high- 
speed computers. It is of interest to 


note that a general computer pro- 


gram covering certain selected cases 
of a three-way table up to size 
55X16 is available at Oak Ridge 
National Laboratory. Illustrations 
of 2X2 X2 problems have been given 
from biometrics by Snedecor (1946) 
and from the behavioral sciences by 
Mayo (1957). 

Another technique for testing high- 
er-order interactions utilizes approxi- 
mation by means of the likelihood 
ratio criterion and has been described 
and illustrated by Mayo (1957). In 
this technique, one can test a number 
of different kinds of higher-order in- 
teractions. For example, in a single 
four-way table, in addition to the six 
simple interactions of pairs of attri- 
butes, one may test 24 different null 
hypotheses about higher-order inter- 
actions, or a total of 30 null hypothe- 
ses for the table. Thus, one may 
test (a) mutual independence among 
all four attributes; (6) mutual inde- 
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pendence among any three attri- 
butes; (c) independence between any 
two attributes; (d) independence be- 
tween one attribute and a combina- 
tion of the remaining three; (e) inde- 
pendence between one attribute and 
a combination of the remaining two; 
and (f) independence between a com- 
binatioa of any two attributes with a 
combination of the other two. 

A third technique for testing high- 
er-order interactions was given by 
Lancaster (1951). His estimate of 
higher-order interaction is based up- 
on the partition of chi square. It is 
applicable whether the parameters 
used are given a priori or are esti- 
mated from the data. It is also gen- 
eral for any number of attributes 
and any number of categories. Al- 
though Lancaster's component for 
interaction is different from Bart- 
lett’s, he shows that they are asymp- 
totically the same. Lancaster's meth- 
od has the advantage of being com- 
putationally simpler, although it is 
not made clear just what hypotheses 
are being tested. 

Simpson (1951) has clarified the 
interpretation of interaction in con- 
tingency tables to some extent; he 
has compared the specific interpreta; 
tions of Bartlett and Lancaster; he 
has also pointed out some pitfalls to 
be avoided in the interpretation of 
interactions. Illustrations of higher 
order interactions from the behavior- 
al sciences have been given by Mayo 
(1957) and by Sutcliffe and Haber- 
man (1956). 


COMPUTATIONAL PROCEDURES 


The computational labor involved 
in applying some analyses to contin- 
gency data has been prohibitive; ex- 
amples are data involving a large 
number of cells, higher-order inter- 
actions, exact tests, and iterative 
procedures for a series of like data 
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such as item analysis or large scale 
questionnaire interpretation. The 
problem of computational labor has 
been attacked in a number of ways. 
For Fisher’s exact test, the tabled 
probabilities for a great many 2X2 
configurations and sample sizes by 
Armsen, Finney, Latscha, and Main- 
land have already been mentioned. 
Also, for the usual chi-square test 
there are a number of formulas which 
do not require the computation of ex- 
pected frequencies as an intermedi- 
ate step. Such a formula for the 2X2 
case is well known and has appeared 
in a number of textbooks; the formu- 
la for the r X 2 case is also well known; 
it sometimes goes under the name of 
the “Brandt-Snedecor” formula and 
has also appeared in a number of text- 
books. A similar formula for the 
r Xs case is less well known. To the 
author's knowledge, it has not ap- 
peared in a textbook, although it was 
published in two rather specialized 
journals (Carroll & Bennett, 1950; 
Leslie, 1951). It has been known by 
some research workers at universities 
and in military research; however, it 
does not seem to be as generally 
known as the other formulas for chi 
square. A computing routine for the 
r Xs formula was described by Mayo 
(1959). In one case, use of the formu- 
la reduced the number of machine 
and pencil operations by one half. 
An approximate graphical tech- 
nique for determining significance 
level for the 2X2 table by the simple 
addition and subtraction of cell fre- 
quencies was described by Trites 
(1957). The sample sizes tabled have 
a lower limit of 40, which is approxi- 
mately equal to the upper limit for 
the exact functions previously re- 
ferred to, while the upper limit is 200. 
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To use this test, one must draw inde- 
pendent samples; for maximum use- 
fulness, the two samples should be of 
equal size, although the author does 
show how some cases can be handled 
in a more approximate fashion. 
Another approximate, graphical tech- 
nique was devised by Bross and 
Kasten (1957); it does not require 
equal samples and is amenable for 
cases in which one column total is as 
large as 49. 

With the advent of electronic com- 
puters, one should find that some old- 
er, formerly prohibitive techniques 
will become feasible and probably 
newer computing programs will be- 
come available. 


SUMMARY 


The contingency principle for clas- 
sifying, analyzing, and interpreting 
categorical data has been well known 
by research workers for several dec- 
ades. Not until the last decade, 
however, has it realized more of its 
potential usefulness as it has been 
applied to a wider range of data. 
Some problems inherent in the fur- 
therence of its usefulness were dis- 
cussed as well as solutions for these 
problems. 

The analytical techniques treated 
here and those additional ones sure 
to come in the near future promise: 
(a) improved interpretation of con- 
tingency data of the usual kinds; 
(6) means of quantifying qualitative 
data so as to provide additional vari- 
ables for research investigations; and 
(c) contributions to both theory and 
practice in configural scoring and pat- 
tern analysis problems, when one is 
interested in higher-order interac- 
tions. 
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ERRATUM 


The study attributed to Goldstein and Sheerer (1941) by 
Chown (Rigidity—A Flexible Concept, Psychol. Bull., 1959, 56, 
p. 202) is in fact work by L. D. Goodstein (1953) as properly 
listed in the bibliography at the end of the article. 
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APA Members and Journal Subscribers— 
Are you going to move? 


If you move— 


your journals will not follow you from your old address :o your new one 


When you move— 
notify the APA Subscription Office 


Formerly, journals that could not be delivered because subscribers had not notified the 
APA of a new address were reclaimed by the APA, and the journal was remailed to the 
subscriber at his new address. This was always expensive. Recent changes in the postal 
laws and regulations have made the expense prohibitive. Undeliverable copies are now 
destroyed by the Post Office. Subscribers who do not receive a journal because of an 
address change are charged the regular single issue price for a replacement copy. 


So—when you move— 
Notify the postmaster at your old address and guarantee that you will pay the forwarding 


postage. 


Notify the APA Member Subscription Department as early as possible—by at least the 
tenth of the month preceding the month when the change should take effect. 
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